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PHARMACEUTICAL COMPOUNDS 



This invention relates to novel inhibitors of protein tyrosine phosphatase (PTP) 
activity, to a novel PTP form and crystal structures thereof, and to the uses of the 
5 crystal structures and novel form intermediate in the design of new drug molecules. 
The invention also provides the use of the inhibitors in medicine and in particular 
the treatment of disease states mediated by PTP activity, and to pharmaceutical 
compositions containing the compounds. 

10 Background of the Invention 

Protein tyrosine phosphatases (PTPs) are crucial in regulating signal transduction 
pathways involving tyrosine phosphorylation^ and have been implicated in cancer, 
diabetes, rheumatoid arthritis, and hypertension^. 

Protein tyrosine phosphorylation plays a major role in regulation of many cell 
15 functions including response to hormones, growth factors and cytokines as well as 
in cell proliferation and apoptosis. Protein tyrosine phosphatases (PTPases) 
therefore represent an important control point in these regulatory mechanisms. 
Atypical tyrosine phosphorylation of specific proteins or components of signal 
transduction pathways has been implicated in a variety of human diseases including 
20 diabetes (diabetes type I and II) obesity, autoimmune diseases, acute and chronic 
inflammation, osteoporosis, proliferative disorders including various forms of 
cancer, growth disorders, response to infection and defective platelet aggregation. 
Such atypical tyrosine phosphorylation can result fi"om dysregulation of both the 
kinases and or phosphatases controlling the process. In many cases PTPase activity 
25 is the major mechanism limiting the extent of phosphorylation and therefore such 
phosphatases represent key targets for therapeutic agents intended to exert 
pharmacological control over such processes. 

Type 2 diabetes is characterized by abnormalities of insulin secretion and by insulin 
resistance of the major target tissues producing a diminished uptake and 
30 metabolism of glucose. 
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Protein-tyrosine phosphatases (PTPases) play a key role in the regulation of 
reversible tyrosine phosphorylation in the insulin action pathway. The receptor for 
insulin is an integral membrane protein with tyrosine kinase activity and insulin 
signal transduction is initiated by the phosphorylation of specific tyrosyl residues 
5 receptor. This initiates a complex signalling cascade leading to the phosphorylation 
of several key substrates including the IRS proteins on specific tyrosine residues. 
These activation steps are balanced, in tum, by specific cellular PTPases that 
dephosphorylate and inactivate the receptor kinase and reverse the adapter fimction 
of the receptor substrate proteins. PTPIB is a key component of this network and 
10 levels of PTPIB have been reported to be increased in diabetes associated with 

insulin resistance. Inhibition of PTPIB would therefore be expected to increase the 
strength of the signal initiated by the insulin resceptor and reverse the insulin 
resistance in such patients. 

Other cellular responses dependent on the action of tyrosine kinases are similarly 
15 dependent on the phosphatases which limit the strength of the response. For 
example the growth factors EGF, VEGf and PDGF all initiate a network of 
signalling cascades dependent on tyrosine phosphorylation by their specific 
receptor tyrosine kinases. The response of lymphocytes to specific antigen 
activation and of other immime cells to cytokines such as IL-6 also use non- 
20 receptor tyrosine kinases as key components in their signal transduction pathways. 
Therefore inhibition of other members of the PTP family would be expected to 
control cell growth, cellular transformation, tumor formation, lymphocyte 
activation, cell migration, and inflammatory responses. 

PTPIB belongs to a large family of PTPs characterised by an 1 1 residue signature 
25 sequence (W)HCXAGXXR(S/T)G which includes the catalytic cysteine 

(Cys215)^''^. Its catalytic mechanism involves a nucleophilic attack by Cys215 on a 
phosphotyrosine substrate resulting in a covalent phosphocysteine intermediate, 
which is subsequently hydrolysed by an activated water molecule^ ^ The crystal 
structure of PTPIB shows that the PTP signature motif adopts a cradle-like 
30 conformation forming the base of the active site (figure la)^^. Its backbone amide 
atoms point to the centre of the cradle, which together with the invariant Arg221 
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provides an excellent environment to stabilise the negatively charged Cys215 side 
chain and bind the phosphate moiety of an incoming substrate^^ (figure la). A 
hydrogen bond between the hydroxyl group of Ser222 and the sulphur atom of 
Cys215 further stabilises the cysteine conformation and helps to maintain its 
5 reduced pKa (-5.4)^'*. One side of the active site is flanked by the so-called WPD- 
loop, which adopts an open conformation in the unliganded enzyme and closes over 
a boxmd phosphotyrosine residue^ ^. The opposite side is formed by the 
phosphotyrosine (pTyr) recognition loop containing Tyr46, which mainly 
determines specificity for phosphotyrosine substrates (figure la)^^. 

10 Increasing literature evidence suggests that the cellular redox state is involved in 
regulating PTP activity by reversibly oxidizing their catalytic cysteines. Current 
literature describes the role of the sulfenic acids (Cys-SOH)^"^. Further oxidation to 
the sulfinic (Cys-S02H) and sulfonic (Cys-SOsH) forms causes irreversible 
inhibition. 

15 Further, there are examples of sulphur-nitrogen bonds in the literature. The reaction 
of a sulfenic acid derivative of glyceraldehyde-3-phosphate dehydrogenase with the 
small molecule, benzylamine, to form a sulfenamide has been reported by Allison et 

Reich et ah has described studies involving the formation of cyclic selenenamide 
20 structure in a small molecule model system and suggested that in its oxidized form 
mammaUan glutathione peroxidase, a selenoenzyme, may have a cyclic 
selenenamide structure. 

Summary of the Invention 

The AppUcants have found that oxidation of the catalytic cysteine at the active site 
25 of PTP by oxidants leads to the formation of a sulfenyl amide moiety at the active 
site. The 'sulfenyl amide' is an isothiazolidin-3-one ring system, which has not 
been previously observed in proteins. The sulfenyl amide moiety is believed to be a 
protective intermediate in the oxidative inhibition of PTPs that prevents further 
irreversible oxidation to sulfinic and sulfonic acids. Formation of the sulfenyl 
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amide moiety at the active site leads to a loss of the enzyme's catalytic activity. 
Reduction of the sulfenyl amide moiety with a physiological reducing agent such as 
glutathione leads to regeneration of the active form of the enzyme. 

This invention is based in part on recognition that compounds that stabilize the 
5 sulfenyl amide form or effect reversible or irreversible covalent modification of the 
sulfenyl amide form will be useful as therapeutic agents. Thus, by preventing or 
inhibiting the reversion of the inactive or less active sulfenyl amide form to the 
active form of PTP, the overall level of activity of PTP within a cellular 
environment can be substantially reduced. 

10 Accordingly, the invention provides compoimds that inhibit reversion of the PTP 
sulfenyl amide to the active form of PTP and their use in therapy. 

Also covered by the invention are cysteine-containing proteins which have a 
suitably nucleophilic cysteine in the active site to facilitate formation of the sulfenyl 
amide. This includes phosphatases and phosphatase-like proteins which are 
15 structurally homologous to the PTPs such as rhodanese and bacterial 
phosphotransferases e.g. IIBcel (also known as IIBchb). 

The term "cysteine-containing proteins" includes all proteins characterised by the 
HC(X5)R signature motif and other proteins belonging and related to this family, 
for example, those that have a remnant of this motif capable of adopting a 

20 conformation similar to the PTP phosphate binding cradle and which have a 

catalytic cysteine. A preferred set of proteins is the set in which there is an active 
site cysteine and an unusually polarised peptide bond between the active site 
cysteine and the following residue, in particular those with the HC(X5)R signature 
motif. One hypothesis is that in the HC(X5)R signature motif this bond is polarised 

25 by the conserved His in the signature motif. The HC(X5)R phosphatase family 
includes classical PTPs as well as the more distantly related low molecular weight 
(LMW) phosphatases, dual specificity phosphatases and rhodanese/CDC25 
superfamily. A preferred subset is the set of those cysteine-containing proteins that 
do not have more than one cysteine in the active site. 
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PTPs of the invention characterised by the above structural motif include all PTPs. 
In a preferred aspect of the invention this refers to PTPs without a second active site 
cysteine in close proximity of the catalytic cysteine, and more preferably those 
PTPs with one cysteine residue in the binding site. Preferred PTPs are PTPs 
5 characterised by the 1 1 residue signature sequence (W)HCXAGXXR(S/T)G. 
Preferred PTPs include LAR, T-cell PTP, PTP-a and PTPlb, more preferably 
PTPlb. 

The sulfenyl amide form of PTBIB has been prepared by the Applicants under 
controlled conditions and its structure determined by X-ray diffraction analysis. 
10 The structural data can be used in methods of rational drug design to provide 

compounds that inhibit reversion of the PTP sulfenyl amide to the active form of 
PTP. 

Although the invention is specifically illustrated herein by reference to PTP IB, it is 
considered to be applicable also to other PTPs where the cellular redox state is 

15 involved in regulating PTP activity by reversibly oxidizing their catal>lic cysteines 
to form sulfenyl amides. The terms "protein tyrosine phosphatase sulfenyl amide", 
"PTP sulfenyl amide" and "sulfenyl amide" used herein refer generally to PTPs in 
which a cysteine moiety at the catalytic site has been oxidized to form a sulfenyl 
amide, imless the context indicates otherwise. For the avoidance of doubt, it is 

20 noted that the terms "protein tyrosine phosphatase sulfenyl amide", "sulfenyl amide 
protein tyrosine phosphatase", "PTP sulfenyl amide" and "sulfenyl amide PTP" as 
used herein are interchangeable and refer to the same entity unless the context 
requires otherwise. 

The various aspects and embodiments of the invention are described in more detail 
25 below and defined in the claims appended hereto. 

Brief Description of the Drawings 

Figure 1 provides a comparison of the structures of native and sulfenyl-amide 
PTPIB. 
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Figure la is a ribbon diagram of FTP IB showing the phosphate-binding cradle, the 
WPD-loop and the pTyr recognition loop. 

Figure lb shows the superimposition of the structure of native PTPIB ( light grey) 
and the sulfenyl-amide containing structure (dark grey) showing the different 
5 conformations of the pTyr recognition loop and the phosphate-binding cradle. 

Figure Ic shows the electron density of the novel sulfenyl-amide derivative and its 
neighbouring residues. The electron density map in Figure Ic is contoured at la. 
All figures are generated using Aesop (Martin Noble, unpublished). 

Figure 2 shows a putative mechanism of sulfenyl-amide formation and subsequent 
10 reactivation. As illustrated, the catalytic cysteme of PTPIB (E-SH) is oxidised to a 
sulfenic acid (E-S-OH). The sulfenyl-amide may be formed by a direct mechanism 
involving a nucleophilic attack of the backbone nitrogen of Ser216 on the Sy atom 
of Cys215 and subsequent release of water. Alternatively the sulfenic acid may be 
oxidised to a highly reactive intermediate by an oxidising agent e.g. by peroxide 
15 e.g. H202^'*' or an oxidised thiol, which then reacts to give the sulfenyl-amide. 
Reactivation of the enzyme occxirs via mixed disulfide formation with a thiol. R, 
denotes glutathione or DTT, X the leaving groups OOH (sulfenoperoxoic acid) or 
OS(0)R (sulfinothioic acid). 

Figure 3 illustrates the different oxidation states of the catalytic cysteine. The 
20 structures shown are the sulfonic (A), sulfinic (B) and sulfenic acid derivatives (C) 
of Cys215. The phosphate-binding cradle comprising residues 215 to 222 is shown 
in ball-and-stick representation. Hydrogen bonds are shown as dashed lines. The 
maps are contoured at 3a and in all maps the peaks are higher than 5a. 

Detailed Description of the Invention 

25 The PTP sulfenvl amides of the invention 

In one aspect, the present invention contemplates an isolated PTP sulfenyl amide. 
The sulfenyl amides of the invention have a variety of uses, as described herein. 



7 



The terms "PTP sulfenyl amide", "sulfenyl amide PTP" and (in the context of PTP) 
"sulfenyl amide" alone are used herein as generic terms to define any PTP (as 
defined below) in which a cysteine moiety at the catalytic site is oxidized to form a 
sulfenyl amide. 

5 

The term "PTP" is used generally herein (and in particular in the context of the PTP 
sulfenyl amides of the invention) as a generic term to include all members of the 
PTP protein family, whether natural, synthetic or recombinant. Preferred are PTPs 
without a second active site cysteine in close proximity to the catalytic cysteine, and 
10 more particularly preferred are those PTPs with only one cysteine residue in the 

binding site. Such preferred PTPs include LAR, T-cell PTP, PTP-a and PTPIB (the 
latter specifically exemplified herein). 

The term "PTP" is also intended to encompass PTP homologues, analogues, alleUc 
15 forms, species variants, derivatives, muteins or equivalents, whether natural, 
synthetic or recombinant (as hereinbelow defined). 

The term "homologue" is used herein in two distinct senses. It is used sensu stricto 
to define proteins that share a common ancestor to the PTP. In this sense it covers 

20 orthologues (species variants which have diverged in different organisms following 
a speciation event) and paralogues (variants which have diverged within the same 
organism after a gene duplication event). Thus, there is a direct evolutionary 
relationship between the PTP and such homologues and this may be reflected in 
structural and/or functional similarities. For example, orthologues may perform the 

25 same role in each organism in which they are found, while paralogues may perform 
functionally related (but distinct) roles within the same organism. 

The term is also used herein sensu lato to define a PTP which is to some extent 
structurally similar (i.e. not necessarily evolutionary related and/or structurally and 
functionally equivalent) to a given (reference) PTP (for example, to any one of 
30 LAR, T-cell PTP, PTP-a and PTPIB). In this sense, homology is recognised on the 
basis of purely structural criteria by the presence of amino acid sequence identities 
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and/or conservative amino acid changes and/or similar secondary, tertiary or 
quaternary structures. 

In this context, a conservative amino acid substitution is one in which the amino 
acid residue is replaced with an amino acid residue having a similar side chain. 
5 Families of amino acid residues having similar side chains have been defined in the 
art (as set out for example by Dayhoff et alia. Atlas of protein structure vol. 5, 
National BioMed Fd'n, Washington D.C., 1979). These families include amino 
acids with basic side chains (e. g., lysine, arginine, histidine), acidic side chains 
(e.g. aspartic acid, glutamic acid), non-charged polar side chains (e. g. glycine, 
10 asparagine, glutamine, serine, threonine, tyrosine, cysteine), non-polar side chains 
(e. g. alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan), beta-branched side chains (e. g. threonine, valine, isoleucine), and 
aromatic side chains (e. g. tyrosine, phenylalanine, tryptophan, histidine). 

15 The homologues of the invention therefore include proteins having at least 50%, 
55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% or 95% sequence identity with the 
reference FTP, and include truncated forms of naturally-occurring FTP proteins. 
Such truncates are preferably at least 25%, 35%, 50% or 75% of the length of the 
corresponding wild-type FTP and may have at least 50%, 55%, 60% or 65% 

20 sequence identity (more preferably, at least 70%, 75%, 80%, 85%, 90% or 95% 
sequence identity) with that wild-type FTP. Particularly preferred homologues are 
truncates that contain a segment preferably comprising at least 8, 15, 20 or 30 
contiguous amino acids that share at least 75%, 80%, 85%, 90% or 95% sequence 
identity with that reference PTP. 

25 Preferred truncates for PTP lb are residues 1-321 or residues 1-298 of the reference 
wild-type sequence. A particularly preferred truncate for PTP lb is the one defined 
by residues 1-321 of reference PTP lb sequence accession number PI 8031 
[SwissProt: PTN1_HUMAN]. 



9 



For the avoidance of doubt, full length sulfenyl amide wild-type PTPs are within 
the scope of the invention as well as truncated versions of sulfenyl amide wild-type 
PTPs. 

For the purposes of the invention, homologues may also be recognised as those 
5 proteins the corresponding DNAs of which are capable of specifically or selectively 
cross-hybridising, or which can cross-hybridise under selective, appropriate and/or 
appropriately stringent hybridisation conditions. 

The term "selectively or specifically (cross)hybridisable" in this context indicates 
10 that the sequences of the corresponding ssDNAs are such that binding to a unique 

(or small class) of homologous sequences can be obtained imder more or less 

stringent hybridisation conditions. Exemplary stringent conditions can be found in, 

for example. Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. 

(1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization 
15 conditions is hybridization in 6x sodium chloride/sodium citrate (SSC) at about 

45 °C, followed by one or more washes in 0.2x SSC, 0.1 SDS at a temperature of 

from about 50'^C to 65°C. 

The term "allelic form" is used herein to define a naturally-occurring alternative 
20 form (allelic variant) of a wild-type PTP sequence which reflects naturally- 
occurring differences in the PTP gene pool. Allelic forms may be isolated and 
identified by making suitable probes or primers from the sequences provided herein 
and screening a suitable nucleic acid source from individuals of the appropriate 
species. 

25 The term "analogue" is used herein to define proteins with similar functions and/or 
structures and which are not necessarily evolutionarily related. PTP analogues 
which share fimction but which have no or little structural similarities are likely to 
have arisen by convergent evolution. Conversely, PTP analogues which share 
structural similarities but which exhibit few or no fimctional similarities are likely 

30 to have arisen by divergent evolution. PTP analogues may be identified, for 

example, by screening a library of proteins to detect those with similar fimction(s) 
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but different physical properties, or by screening for proteins which share structural 
features but not necessarily any functions (e.g. by immunological screening). 

The term "species variant" is used herein to define the corresponding PTP from a 
different organism. Thus, species variants share a direct evolutionary relationship. 

5 

Species variants may be isolated and identified by making suitable probes or 
primers from the sequences provided herein and screening a suitable nucleic acid 
source from the desired species. Preferably, species variants are those isolated from 
mammalian species. Most preferably, species variants are those isolated from 

10 certain mammalian species such as, for example. Pan troglodytes, Gorilla gorilla, 
Pongo pygmaeus, Hylobates concolor, Macaca mulatta, Papio papio, Papio 
hamadryas, Cercopithecus aethiops, Cebus capucinus, Aotus trivirgatus, Sanguinus 
oedipus, Microcebus murinus, Mus musculus, Rattus norvegicus, Cricetulus 
griseus, Felis catus, Mustela vison, Canis familiaris, Oryctolagus cmiiculus. Bos 

15 taurus, Ovis aries, Sus scrofa, and Equus caballus, for which genetic maps have 
been created allowing the identification of syntenic relationships between the 
genomic organization of genes in one species and the genomic organization of the 
related genes in another species (O'Brien and Seuanez, 1988, Ann. Rev. Genet. 22 : 
323-351 ; O'Brien et al., 1993, Nature Genetics 3: 103-112; Johansson et al., 1995, 

20 Genomics 25: 682-690; Lyons et al., 1997, Nature Genetics 15: 47-56; O'Brien et 
al., 1997, Trends in Genetics 13 (10): 393-399; Carver and Stubbs, 1997, 
GenomeResearch7 : 1 123-1 137; all of which are incorporated by reference herein). 

The term "derivative" as applied herein to the PTPs of the invention is used to 
25 define PTPs which are modified versions of any wild-type or truncated PTP. Such 
derivatives may include fusion proteins, in which the proteins of the invention have 
been fiised to one or more different proteins, peptides or amino acid tags (for 
example an antibody or a protein domain conferring a biochemical activity, to act as 
a label, or to facilitate purification). Particularly preferred are derivatives in which 
30 the PTP proteins or peptides are modified by a polyHis (6xHis) tag to facilitate 
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purification of the peptide derivative on Ni agarose beads. It is further preferred 
that the proteins are derivatives of truncated PTP proteins. 

The derivatives may also be products of synthetic processes that use a wild-type 
5 PTP as a starting material or reactant. 

The term "mutein" is used herein to define PTPs that are mutant forms of a wild- 
type PTP, i.e. PTP proteins in which one or more amino acids have been added, 
altered, deleted, replaced, inserted or substituted. The muteins of the invention 
10 therefore include fragments, truncates and fusion peptides (e.g. comprising fused 
immunoglobulin, receptor, tag, label or enzyme moieties). 

The muteins of the invention therefore include truncated forms of a wild-type PTP. 
Such truncates are preferably least 25%, 35%, 50% or 75% of the length of the 
15 corresponding wild-type PTP and may have at least 65% sequence identity (more 
preferably, at least 70%, 75%, 80%, 85%, 90% or 95% sequence identity) with that 
PTP. 

The muteins of the invention also include PTPs in which mutations have been 
20 introduced which effectively promote or impair one or more activities of the PTP, 
for example mutations which promote or impair the function of the active site. 

Muteins may be produced by any convenient method. Conveniently, site-directed 
mutagenesis with mutagenic oligonucleotides may be employed using a double 
25 stranded template (pBluescript KS II construct containing a PTP gene), (e.g. 
ChameleonM or QuikChange'M - StratageneM). After verifying each mutant 
derivative by sequencing, the mutated gene is excised and inserted into a suitable 
vector so that the modified protein can be over-expressed and purified. 

30 Preferred mutant forms are truncates consisting (or consisting essentially) of the 
PTP 1 1 -residue signature sequence described herein. Particularly preferred are 
truncates that contain a segment preferably comprising at least 8, 15, 20 or 30 
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contiguous amino acids that share at least 75%, 80%, 85%, 90% or 95% sequence 
identity with the PTP from which they are derived by truncation. 

The term "equivalent" is used herein to define those PTP analogues which exhibit 
substantially the same function(s) and which share at least some structural features 
5 (e.g. functional domains), but which have not evolved from a common ancestor. 
Such equivalents are typically synthetic proteins (see below) and may be generated, 
for example, by identifying sequences of functional importance (e.g. by identifying 
conserved or canonical sequences, functional domains or by mutagenesis followed 
by functional assay), selecting an amino acid sequence on that basis and then 

10 synthesising a peptide based on the selected amino acid sequence. Such synthesis 
can be achieved by any of many different methods known in the art, including solid 
phase peptide synthesis (to generate synthetic peptides) and the assembly (and 
subsequent cloning) of oligonucleotides. Some synthetic protein analogues may be 
chimaeras, and such equivalents can be designed and assembled for example by 

15 concatenation of two or more different structural and/or functional peptide domains 
from different proteins using recombinant DNA techniques. 

The homologues, analogues, fragments, muteins, equivalents or derivatives of the 
PTPs of the invention may also be defined inter alia as those proteins which cross- 
react with antibodies to one or more wild-type PTPs, and in particular those which 

20 cross-react with antibodies directed against a PTP lacking a second active site 

cysteine in close proximity of the catalytic cysteine (for example a PTP with only 
one cysteine residue in the binding site). Thus, the homologues, fragments, 
muteins, equivalents or derivatives of the PTPs of the invention include proteins 
which cross-react with antibodies to one or more of LAR, T-cell PTP, PTP-a and 

25 PTPIB. 

For the purposes of the present invention, sequence identity is detemiined by 
comparing the amino acid sequences of the protein when aligned so as to maximize 
overlap and identity while minimizing sequence gaps. In particular, sequence 
30 identity may be determined using any of a nimiber of mathematical algorithms. A 
nonlimiting example of a mathematical algorithm used for comparison of two 
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sequences is the algorithm of Karlin and Altschul (1990) Proc. Natl, Acad. Sci. 
USA 87: 2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. 
Sci. USA 90: 5873-5877. 

5 Another example of a mathematical algorithm used for comparison of sequences is 
the algorithm of Myers and Miller (1988) CABIOS 4: 11-17. Such an algorithm is 
incorporated into the ALIGN program (version 2.0) which is part of the GCG 
sequence alignment software package. When utilizing the ALIGN program for 
comparing amino acid sequences, a PAM120 weight residue table, a gap length 
10 penalty of 12, and a gap penalty of 4 can be used. Yet another useful algorithm for 
identifying regions of local sequence similarity and alignment is the FASTA 
algorithm as described in Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 
85:2444-2448. 

Preferred for use according to the present invention is the WU-BLAST 
(Washington University BLAST) version 2.0 software. WU-BLAST version 2.0 
executable programs for several UNIX platforms can be downloaded from ftp 
://blast. wustl. edu/blast/executables. This program is based on WU-BLAST 
version 1.4, which in tum is based on the public domain NCBI-BLAST version 1.4 
(Altschul and Gish, 1996, Local ahgnment statistics, DooUttle ed.. Methods in 
Enzymology 266: 460-480; Altschul et al., 1990, Basic local alignment search tool, 
Joumal of Molecular Biology 215: 403-410; Gish and States, 1993, Identification of 
protein coding regions by database similarity search. Nature Genetics 3: 266-272; 
Karlin and Altschul, 1993, Applications and statistics for multiple high-scoring 
segments in molecular sequences, Proc. Natl. Acad. Sci. USA 90: 5873-5877; all of 
which are incorporated by reference herein). 

In all search programs in the suite the gapped alignment routines are integral to the 
database search itself Gapping can be turned off if desired. The default penalty (Q) 
30 for a gap of length one is Q=9 for proteins and BLASTP, and Q=10 for BLASTN, 
but may be changed to any integer. The default per-residue penalty for extending a 
gap (R) is R=2 for proteins and BLASTP, and R=10 for BLASTN, but may be 
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changed to any integer. Any combination of values for Q and R can be used in 
order to align sequences so as to maximize overlap and identity while minimizing 
sequence gaps. The default amino acid comparison matrix is BLOSUM62, but other 
amino acid comparison matrices such as PAM can be utilized. 

5 

The term "isolated" is used herein to indicate that the FTP sulfenyl amide exists in a 
physical milieu distinct from that in which it occurs in nature. For example, the 
isolated sulfenyl amide may be substantially isolated with respect to the complex 
cellular milieu in which it naturally occurs. The absolute level of purity is not 
10 critical, and those skilled in the art can readily determine appropriate levels of 

purity according to the use to which the PTP sulfenyl amide is to be put. The term 
"isolating" when used a step in a process is to be interpreted accordingly. 

In many circumstances, the isolated PTP sulfenyl amide will form part of a 
15 composition, for example a more or less cmde extract containing many other 

molecules and substances, buffer systems, matrices or excipients, which may for 
example contain other components (including assay reagents and proteins, such as 
albumin). 

20 In other circimistances, the isolated PTP sulfenyl amide may be purified to essential 
homogeneity, for example as determined by PAGE or column chromatography (for 
example HPLC or mass spectrometry). In preferred embodiments, the isolated FTP 
sulfenyl amide is essentially the sole protein in a given composition. 

25 The isolated PTP sulfenyl amide of the invention may be crystallized. Crystals of 
the isolated PTP sulfenyl amide find particular utility in some applications of the 
invention (for example, for the in silico analyses described below). 

The PTP sulfenyl amides of the invention need not be isolated in the sense defined 
30 above, however. For example, more or less crude preparations derived from spent 
media used to culture host cells expressing PNP or PNP sulfenyl amide may be 
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used. Such supematants may be treated in various ways, for example by oxidation, 
concentration, filtration, centrifugation, spray drying, dialysis and/or lyophilisation. 

The term "natural" is used herein to define a PTP that has been derived firom a 
5 naturally-occurring wild-type protein source by chemical or enzymic treatment. 
Naturally occurring PTPs may be obtained by purification (e.g. by column 
chromatography) fi-om cellular material in which the native PTP is expressed. 

The term "synthetic PTP" is used herein to define a PTP that has been chemically 
10 synthesised in vitro (for example by any of the commercially available solid-phase 
peptide-synthesis systems). 

The term "recombinant" is used herein to define a PTP that has been produced by 
that body of techniques collectively known as "recombinant DNA technology". 

15 

Thus, although this invention is based (at least in part) on the identification and 
characterization of a novel sulfenyl amide intermediate arising firom oxidation of a 
cysteine moiety at the active site of PTP IB (see below), any of a large number of 
different PTP sulfenyl amides are contemplated by the invention and each finds 
20 general application in the various methods and processes described herein. 

Use of the PTP sulfenvl amides of the invention per se in Drug Discovery 

The PTP sulfenyl amides of the invention may be used inter alia in various drug 
screening processes. 

For example, the invention provides a process for screening for a PTP inhibitor 
25 comprising the steps of: (a) providing the PTP sulfenyl amide (or a homologue, 
allelic form, species variant, derivative or mutein thereof); (b) contacting the 
sulfenyl amide of step (a) with a test compound; and (c) determining whether the 
test compound binds to the sulfenyl amide. 

The screening processes of the invention as described above are preferably high 
30 throughput processes. The screens identify and/or select compounds with PTP 
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sulfenyl amide binding activity. Such compounds are candidate PTP modulators, 
and can be subjected to further analysis and/or screening in order to determine their 
activity as therapeutic agents (see for example the section headed "Assays for 
Screening for Active Compounds", below). Alternatively, or in addition, they may 
5 be crystallized with PTP IB sulfenyl amide (e.g. by co-crystallization or by soaking) 
for X-ray analysis. The resulting X-ray structure may be compared with that of 
Table 1 or Table 2 for a variety of purposes. 

For example, the PTP sulfenyl amides of the invention may be used in a process for 
producing a PTP inhibitor comprising the steps of: (a) providing a PTP sulfenyl 
10 amide (or a homologue, allelic form, species variant, derivative or mutein thereof); 

(b) contacting the sulfenyl amide of step (a) with a test compoxmd; (c) determining 
whether the test compound binds to the sulfenyl amide; and (d) identifying the test 
compound as a PTP inhibitor on the basis of its ability to prevent or inhibit the 
reductive activation of the PTP sulfenyl amide to active PTP. 

15 In such processes, at least two chemically distinct test compoimds may be identified 
in step (d) and the process may then further comprise the step of linking two or 
more of the chemically distinct compounds to produce a multimeric PTP inhibitor. 
Such processes embody the linked fragment approaches described in more detail in 
the section headed "Linked fragment and fragment growing approaches", below. 

20 The PTP sulfenyl amides of the invention may therefore be used in a process for 
producing a pharmaceutical composition comprising the steps of: (a) providing a 
PTP sulfenyl amide (or a homologue, allelic form, species variant, derivative or 
mutein thereof); (b) contacting the sulfenyl amide of step (a) with a test compound; 

(c) determining whether the test compound binds to the sulfenyl amide; (d) 
25 identifying the test compound as a PTP inhibitor on the basis of its ability to 

prevent or inhibit the reductive activation of the PTP sulfenyl amide to active PTP; 
and (e) incorporating the inhibitor identified in step (d) into a pharmaceutical 
excipient. 
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The invention contemplates PTP inhibitors, drugs and pharmaceutical 
compositions obtainable by, or obtained by, the process of the invention described 
above. 

Identification and Characterization of the Three Dimensional Structure of Sulfenvl 
5 Amide PTP 

This invention is based on the identification and characterization of a novel sulfenyl 
amide intermediate arising fi-om oxidation of a cysteine moiety at the active site of a 
protein tyrosine phosphatase. 

The catalytic domain (residues 1-321) of PTPIB was expressed in^. coli cells 
10 according to known procedures and was purified and crystallized. The oxidation 
state of the catalytic cysteine of PTPIB was probed by means of soaking 
experiments using various oxidizing agents and crystal structures were 
subsequently obtained for a novel sulfenyl-amide intermediate of PTPIB, as well as 
sulfenic, sulfinic and sulfonic PTPIB derivatives. 

1 5 Soaking crystals of the catalytic domain of PTB 1 B with 2-phenyl-isoxazolidine- 
3,5-dione gave rise to a modified crystal structure (the PTPIB sulfenyl amide), the 
structure for which has been determined by X-ray diffraction analysis. Atomic 
coordinates of the catalytic domain are set out in Table 1 and Table 2. 

Accordingly, in one aspect, the invention provides a crystal of sulfenyl amide 
20 protein tyrosine phosphatase IB. 

Crvstals 

In another aspect, the invention provides a crystal of sulfenyl amide protein tyrosine 
phosphatase IB having a Unit cell dimensions: a = 87.686 A, b = 87.686 A, c = 
103.721 A, a = 90.00^ p = 90.00°, y = 120.00° and a space group: P3i 2 1. Unit 
25 cell variability of 5% may be observed in all dimensions. 

The invention also provides a crystal of sulfenyl amide protein tyrosine 
phosphatase IB having a resolution better than, i.e. numerically lower than, 3.0 A, 
preferably lower than 2.6 A. 
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The invention also provides crystals of sulfenyl amide protein tyrosine phosphatase 
IB capable of being soaked with compoxmd(s) to form co-complex structures. 

Table 1 coordinates 

The invention also provides a crystal of sulfenyl amide protein tyrosine phosphatase 
5 IB having the structure defined by the coordinates of Table 1 . 

Table 1 gives atomic coordinate data for sulfenyl amide protein tyrosine 
phosphatase IB. In Table 1 the third column denotes the atom type, the fourth the 
residue type, the fifth the chain identification, the sixth the residue nxmiber. The 
seventh, eighth and ninth columns are the X, Y, Z coordinates respectively of the 
10 atom in question, the tenth colimm defines the occupancy of the atom, the eleventh 
column gives the temperature factor of the atom. 

Table 2 coordinates 

The invention fiirther provides a crystal of sulfenyl amide protein tyrosine 
phosphatase IB having the structure defined by the coordinates of Table 2. 

15 The coordinates of Table 2 are defined as the coordinates of Table 1 as amended in 
the manner outlined in Table 2. Table 2 varies in six ways fi-om Table 1 . These 
changes do not affect the positioning of the atoms of sulfenyl amide protein 
tyrosine phosphatase IB. The co-ordinates of Table 2 represent the same spatial 
distribution of atoms of sulfenyl amide protein tyrosine phosphatase IB as 

20 contained in Table 1 but in a format consistent with that of the EBI Macromolecular 
Structure Database (Hinxton, UK). Thus, the invention covers all co-ordinate files 
that essentially represent the same spatial distribution of sulfenyl amide protein 
tyrosine phosphatase IB atoms independent of file format. 

The coordinates of Table 1 or Table 2 provide a measure of atomic location in 
25 Angstroms, to a third decimal place. The coordinates are a relative set of positions 
that define a shape in three dimensions, so it is possible that an entirely different set 
of coordinates having a different origin and/or axes could define a similar or 
identical shape. Furthermore, varying the relative atomic positions of the atoms of 
the structure so that the root mean square deviation of the residue backbone atoms 
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(i.e. the nitrogen-carbon-carbon backbone atoms of the protein amino acid residues) 
or the C-alpha atoms is less than 1.5 A (preferably less than 1.0 A, more preferably 
less than 0.5 A and even more preferably less than 0.47 A) when superimposed on 
the coordinates provided in Table 1 or Table 2 for the residue backbone atoms, will 
5 generally result in a structure which is substantially the same as the structure of 
Table 1 or Table 2 in terms of both its structural characteristics and potency for 
structure-based design of PTPIB inhibitors. 

Likewise changing the number and/or positions of the water molecules and/or 
substrate molecules of Table 1 or Table 2 will not generally affect the potency of 

10 the structure for structure-based design of PTPIB inhibitors. Thus for the purposes 
described herein as being aspects of the present invention, it is within the scope of 
the invention if: the Table 1 or Table 2 coordinates are transposed to a different 
origin and/or axes; the relative atomic positions of the atoms of the structure are 
varied so that the root mean square deviation of residue backbone atoms or the C- 

1 5 alpha atoms is less than 1 .5 A (preferably less than 1 .0 A , more preferably less than 
0.5 A and even more preferably less than 0.47 A) when superimposed on the 
coordinates provided in Table 1 or Table 2 for the residue backbone atoms; and/or 
the munber and/or positions of water molecules and/or substrate molecules is 
varied. References herein to the coordinate data of Table 1 or Table 2 thus include 

20 the coordinate data in which one or more individual values of the Tables are varied 
in this way. By "root mean square deviation" we mean the square root of the 
arithmetic mean of the squares of the deviations from the mean. 

It is also within the scope of the invention if the coordinate file represents the same 
spatial distribution of sulfenyl amide protein tyrosine phosphatase IB atoms but in a 

25 different file format. Altemative file formats (e.g. such as a format consistent with 
that of the EBI Macromolecular Structure Database (Hinxton, UK)) which may 
include a different ordering of these atoms, or a different designation of the residues 
or residue molecule atoms, may be used or preferred by others of skill in the art. 
However it will be apparent that the use of a different file format to present or 

30 manipulate the coordinates of the Tables is within the scope of the present 

invention. Thus for the purposes described herein as being aspects of the present 
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invention, it is within the scope of the invention if the coordinates are essentially 
the same as Table 1 or Table 2, essentially comprise the coordinates of Table 1 or 
Table 2, or are a set of coordinates that materially correspond to those of Table 1 or 
Table 2. 

5 Other crystals of the invention include crystals which have selected coordinates of 
the binding pocket, wherein the amino acid residues associated with those selected 
coordinates are located in a protein framework which holds these amino acids in a 
relative spatial configuration corresponding to the spatial configuration of those 
amino acids in Table 1 or Table 2. By "corresponding to", it is meant within an 
10 r.m.s.d. of less than 2.0 A, preferably less than 1.5 A, more preferably less than 1.0 
A, even more preferably less than O.SA and most preferably less than 0.47 A. In a 
further embodiment it is it within an r.m.s.d. of less than 0.3 A, less than 0.25 A, or 
less than 0.2 A, and most preferably less than 0.1 A. The amino acids which 
provide the selected coordinates are preferably selected from amino acids which 
15 form part of at least one sulfenyl amide protein tyrosine phosphatase IB binding 

cavity, where these are residues 1 to 56 as described herein or combinations thereof 
as defined further herein below. 

Those of skill in the art will appreciate that in many applications of the invention, it 
is not necessary to utilise all the coordinates of Table 1 or Table 2, but merely a 
portion of them. For example, as described below, in methods of modelling 
candidate compounds with PTP sulfenyl amide, selected coordinates of PTP 
sulfenyl amide may be used, for example at least 5, preferably at least 10, more 
preferably at least 20 and even more preferably at least 100 atoms of the sulfenyl 
amide structure. Likewise, the other applications of the invention described herein, 
including homology modelling and structure solution, and data storage and 
computer assisted manipulation of the coordinates, may also utilise all or a portion 
of the coordinates of Table 1 or Table 2. A preferred aspect of the invention is 
where the portion of the coordinates relates to the selected coordinates of the 
binding pocket. The amino acids which provide the selected coordinates are 
preferably selected from amino acids which form part of at least one sulfenyl amide 



20 



25 



30 



21 

protein tyrosine phosphatase IB binding cavity, where these are residues 1 to 56 as 
described herein or combinations thereof as defined further herein below. 

It will also be appreciated that the invention also includes within its scope 
crystals of PTP sulfenyl amide comprising amino acids having the atomic 
5 coordinates of Tables 1 or 2, but wherein the crystal comprises further amino acids 
in addition to those for which the coordinates are given. Therefore, unless 
explicitly set out to the contrary, or otherwise made clear fi-om the context, 
references throughout the present specification to the use of all or selected 
coordinates of or from Tables 1 or 2 does not exclude the use of additional 
10 coordinates due to the presence of further amino acids. 

Comparison of protein structures 

Protein structure similarity is routinely expressed and measured by the root mean 
square deviation (r.m.s.d.), which measures the difference in positioning in space 
between two sets of atoms. The r.m.s.d. measures distance between equivalent 

15 atoms after their optimal superposition. The r.m.s.d. can be calculated over all 
atoms, over residue backbone atoms (i.e. the nitrogen-carbon-carbon backbone 
atoms of the protein amino acid residues), main chain atoms only (i.e. the nitrogen- 
carbon-oxygen-carbon backbone atoms of the protein amino acid residues), side 
chain atoms only or more usually over C-alpha atoms only. For the purposes of this 

20 invention, the r.m.s.d. can be calculated over any of these, using any of the methods 
outlined below. 

Methods of comparing protein structures are discussed in Methods of Enzymology, 
vol. 1 15, pg 397-420. The necessary least-squares algebra to calculate r.m.s.d. has 
been given by Rossman and Argos (J. Biol. Chem. , vol. 250, pp7525 (1975)) 

25 although faster methods have been described by Kabsch (Acta Crystallogr., Section 
A, A92, 922 (1976); Acta Cryst. A34, 827-828 (1978)), Hendrickson (Acta 
Crystallogr., Section A, A35, 158 (1979)); McLachan (J. Mol. Biol., vol 128, pp49 
(1979) and Kearsley (Acta Crystallogr., Section A, A45, 208 (1989)). Some 
algorithms use an iterative procedure in which the one molecule is moved relative 

30 to the other, such as that described by Ferro and Hermans (Ferro and Hermans, Acta 
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Crystallographic, A33, 345-347 (1977)). Other methods, e.g. Kabsch's algorithm, 
locate the best fit directly. 

Programs for determining r.m.s.d include MNYFIT (part of a collection of 
programs called COMPOSER, Sutcliffe, M J., Haneef, I., Camey, D. and Blundell, 
5 TX. (1987) Protein Engineering, 1, 377-384), MAPS (Lu, G. An Approach for 
Multiple Ahgnment of Protein Structures (1998, in manuscript and on 
http://bioinfol.mbfys.lu.se/TOP/maps.html)). 

It is usual to consider C-alpha atoms and the r.m.s.d. can then be calculated using 
programs such as LSQKAB (Collaborative Computational Project 4. The CCP4 
10 Suite: Programs for Protein Crystallography, Acta Crystallographica, D50, (1994), 
760-763), QUANTA (commercially available from Accehys, San Diego, CA), 
Insight (commercially available from Accehys, San Diego, CA), Sybyl® 
(commercially available from Tripos, Inc., St Louis), O (Jones et al.. Acta 
Crystallographica, A47, (1991), 1 10-119), and other coordinate fitting programs. 

15 In, for example, the programs LSQKAB and O, the user can define the residues in 
the two proteins that are to be paired for the purpose of the calculation. 
Alternatively, the pairing of residues can be determined by generating a sequence 
alignment of the two proteins, programs for sequence alignment are discussed in 
more detail above. The atomic coordinates can then be superimposed according to 

20 this alignment and an r.m.s.d. value calculated. The program Sequoia (CM. Bruns, 
I. Hubatsch, M. Ridderstrom, B. Mannervik, and J. A. Tainer (1999) Human 
Glutathione Transferase A4-4 Crystal Structures and Mutagenesis Reveal the Basis 
of High Catalytic Efficiency with Toxic Lipid Peroxidation Products, Journal of 
Molecular Biology 288(3): 427-439) performs the alignment of homologous protein 

25 sequences, and the superposition of homologous protein atomic coordinates. Once 
aUgned, the r.m.s.d. can be calculated using programs detailed above. For 
sequences identical, or highly identical, the structural alignment of proteins can be 
done manually or automatically as outlined above. Another approach would be to 
generate a superposition of protein atomic coordinates without considering the 

30 sequence. 
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It is more normal when comparing significantly different sets of coordinates to 
calculate the r.m.s.d. value over C-alpha atoms only. It is particularly usefiil when 
analysing side chain movement to calculate the r.m.s.d. over all atoms and this can 
be done using LSQKAB and other programs. 

5 Mutants 

Also, modifications in the sulfenyl amide protein tyrosine phosphatase IB crystal 
stmcture due to e.g. mutations, additions, substitutions, and/or deletions of amino 
acid residues could account for variations in the atomic coordinates. However, 
atomic coordinate data of sulfenyl amide protein tyrosine phosphatase IB modified 

10 so that a ligand that bound to one or more binding sites of sulfenyl amide protein 
tyrosine phosphatase IB would be expected to bind to the corresponding binding 
sites of the modified sulfenyl amide protein tyrosine phosphatase IB are, for the 
purposes described herein as being aspects of the present invention, also within the 
scope of the invention. References herein to the coordinates of Table 1 or Table 2 

15 thus include the coordinates modified in this way. Preferably, the modified 

coordinate data define at least one sulfenyl amide protein tyrosine phosphatase IB 
binding cavity. 

Crystals of the invention also include crystals of sulfenyl amide protein tyrosine 
phosphatase IB mutants. In addition, sulfenyl amide protein tyrosine phosphatase 
20 IB mutants may be crystallized in co-complex with known sulfenyl amide protein 
tyrosine phosphatase IB substrates or inhibitors or novel compounds. 

As explained herein, a mutant sulfenyl amide protein tyrosine phosphatase IB is a 
sulfenyl amide protein tyrosine phosphatase IB protein characterized by the 
25 replacement or deletion of at least one amino acid fi-om the wild type PTPIB. Such 
a mutant may be prepared for example by site-specific mutagenesis, or 
incorporation of natural or unnatural amino acids. 

As explained herein, the present invention therefore contemplates sulfenyl amide 
30 protein tyrosine phosphatase IB mutants as hereinbefore defined. 
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For example, the sulfenyl amide protein tyrosine phosphatase IB mutants may 
define a polypeptide which is obtained by replacing at least one amino acid residue 
in a native or synthetic sulfenyl amide protein tyrosine phosphatase IB with a 
5 different amino acid residue and/or by adding and/or deleting amino acid residues 
within the native polypeptide or at the N- and/or C-terminus of a polypeptide 
corresponding to sulfenyl amide protein tyrosine phosphatase IB, and which has 
substantially the same three-dimensional stracture as sulfenyl amide protein 
tyrosine phosphatase IB firom which it is derived. By having substantially the same 

10 three-dimensional stracture is meant having a set of atomic stracture co-ordinates 
that have a root mean square deviation (nm.s.d.) of less than or equal to about 1,5 
A, preferably less than 0.47 A, when superimposed with the atomic stracture co- 
ordinates of the sulfenyl amide protein tyrosine phosphatase IB from which the 
mutant is derived when at least about 50% to 100% of the Ca atoms of the sulfenyl 

1 5 amide protein tyrosine phosphatase IB are included in the superposition. A mutant 
may have, but need not have, enzymatic or catalytic activity. 

To produce homologies or mutants, amino acids present in the said protein can be 
replaced by other amino acids having similar properties, for example 

20 hydrophobicity, hydrophobic moment, antigenicity, propensity to form or break a- 
helical or p-sheet stractures, and so. Substitutional variants of a protein are those in 
which at least one amino acid in the protein sequence has been removed and a 
different residue inserted in its place. Amino acid substitutions are typically of 
single residues but may be clustered depending on functional constraints e.g. at a 

25 crystal contact. Preferably amino acid substitutions will comprise conservative 

amino acid substitutions, hisertional amino acid variants are those in which one or 
more amino acids are introduced. This can be amino-terminal and/or carboxy- 
terminal fusion as well as intrasequence. Examples of amino-terminal and/or 
carboxy-terminal fusions are affinity tags, MBP tag, and epitope tags. 

30 

Amino acid substitutions, deletions and additions that do not significantly interfere 
with the three-dimensional stracture of the sulfenyl amide protein tyrosine 
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phosphatase IB will depend, in part, on the region of the sulfenyl amide protein 
tyrosine phosphatase IB where the substitution, addition or deletion occurs. In 
highly variable regions of the molecule, non-conservative substitutions as well as 
conservative substitutions may be tolerated without significantly disrupting the 
5 three-dimensional structure of the molecule. In highly conserved regions, or 
regions containing significant secondary structure, conservative amino acid 
substitutions are preferred. 

As explained earlier, conservative amino acid substitutions are well known in the 
10 art, and include substitutions made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of the 
amino acid residues involved. For example, negatively charged amino acids 
include aspartic acid and glutamic acid; positively charged amino acids include 
lysine and arginine; amino acids with uncharged polar head groups having similar 
15 hydrophilicity values include the following: leucine, isoleucine, valine; glycine, 
alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. Other 
conservative amino acid substitutions are well known in the art. 

In some instances, it may be particularly advantageous or convenient to substitute, 
20 delete and/or add amino acid residues to a sulfenyl amide protein tyrosine 

phosphatase IB binding pocket or catalytic residue in order to provide convenient 
cloning sites in the cDNA encoding the polypeptide, to aid in purification of the 
polypeptide, to modify compound binding etc. Such substitutions, deletions and/or 
additions which do not substantially alter the three dimensional stmcture of sulfenyl 
25 amide protein tyrosine phosphatase IB will be apparent to those having skills in the 
art. 

It should be noted that the mutants contemplated herein need not exhibit enzymatic 
activity. Indeed, amino acid substitutions, additions or deletions that interfere with 
30 the catalytic activity of the protein tyrosine phosphatase IB but which do not 
significantly alter the three-dimensional structure of the catalytic region are 
specifically contemplated by the invention. Such crystalline polypeptides, or the 
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atomic structure co-ordinates obtained there from, can be used to identify 
compounds that bind to the protein. 

The crystalhzation of such mutants and the determination of the three-dimensional 
5 structures by X-ray crystallography rely on the ability of the mutant proteins to 

yield crystals that diffract at high resolution. The mutant protein could then be used 
to obtain information on compound binding through the determination of mutant 
protein/ligand complex structures, which may be characterized using the sulfenyl 
amide protein tyrosine phosphatase IB crystal structure of Table 1 or Table 2. 

10 

The mutations can be introduced by site-directed mutagenesis e.g. using a 
Stratagene QuikChange™ Site-Directed Mutagenesis Kit or cassette mutagenesis 
methods (see e.g. Ausubel et al., eds., Current Protocols in Molecular Biology^ 
John Wiley & Sons, Inc., New York, and Sambrook et al., Molecular Cloning: a 
15 Laboratory Manual, 2nd ed.. Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, (1989)). 

To the extent that the present invention relates to sulfenyl amide protein tyrosine 
phosphatase IB -ligand complexes and mutant and homologue proteins of sulfenyl 
20 amide protein tyrosine phosphatase IB, crystals of such proteins may be formed. 
The skilled person would recognize that the conditions provided herein for 
crystallizing sulfenyl amide protein tyrosine phosphatase IB may be used to form 
such crystals. Altematively, the skilled person would use the conditions as a basis 
for identifying modified conditions for forming the crystals. 

25 

Thus the aspects of the invention relating to crystals of sulfenyl amide protein 
tyrosine phosphatase IB, may be extended to crystals of a mutant, analogue or 
homologue, or mutein 
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Homologv Modelling 

The invention also provides a means for homology modelling of other proteins 
(referred to below as target sulfenyl amide protein tyrosine phosphatase proteins). 
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By "homology modelling", it is meant the prediction of related sulfenyl amide 
protein tyrosine phosphatase structures based either on X-ray crystallographic data 
or computer-assisted de novo prediction of structure, based upon manipulation of 
the coordinate data of Table 1 or Table 2. 

5 

'Homology modeling" extends to target sulfenyl amide protein tyrosine 
phosphatase proteins, which are analogues or homologues of the sulfenyl amide 
protein tyrosine phosphatase IB protein whose structure has been determined in the 
accompanying examples. 

10 

The term "homologous regions" describes amino acid residues in two sequences 
that are identical or have similar (e.g. aliphatic, aromatic, polar, negatively charged, 
or positively charged) side-chain chemical groups. Identical and similar residues in 
homologous regions are sometimes described as being respectively "invariant" and 
1 5 "conserved" by those skilled in the art. 

In general, the method involves comparing the amino acid sequences of the sulfenyl 
amide protein tyrosine phosphatase IB protein of Table 1 or Table 2 with a target 
sulfenyl amide protein tyrosine phosphatase protein by aligning the amino acid 
20 sequences (Dunbrack et al.. Folding and Design, 2, (1997), 27-42). Amino acids in 
the sequences are then compared and groups of amino acids that are homologous 
(conveniently referred to as "corresponding regions") are grouped together. This 
method detects conserved regions of the polypeptides and accounts for amino acid 
insertions or deletions. 

25 

Homology between amino acid sequences can be determined using commercially 
available algorithms. The programs BLAST, GAPPED BLAST, BLASTN, PSI- 
BLASTAND BLAST 2 sequences (provided by the National Center for 
Biotechnology Information) are widely used in the art for this purpose, and can 
30 align homologous regions of two amino acid sequences. These may be used with 
default parameters to determine the degree of homology between the amino acid 
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sequence of the Table 1 or Table 2 protein and other target sulfenyl amide protein 
tyrosine phosphatase proteins, which are to be modeled. 

Analogues are defined as proteins with similar three-dimensional structures and/or 
5 functions with little evidence of a common ancestor at a sequence level. 

Homologues are defined as previously as proteins with evidence of a common 
ancestor, i.e. likely to be the result of evolutionary divergence and are divided into 
remote, medium and close sub-divisions based on the degree (usually expressed as 
10 a percentage) of sequence identity. 

A homologue is defined here as a protein with at least 1 5% sequence identity or 
which has at least one fiinctional domain, which is characteristic of sulfenyl amide 
protein tyrosine phosphatase IB. 

15 

There are two types of homologue: orthologues and paralogues. Orthologues are 
defined as homologous genes in different organisms, i.e. the genes share a common 
ancestor coincident with the speciation event that generated them. Paralogues are 
defined as homologous genes in the same organism derived fi-om a 
20 gene/chromosome/ genome duplication, i.e. the common ancestor of the genes 
occurred since the last speciation event. 

The homologues could also be mutants as described above. 

25 Once the amino acid sequences of the polypeptides with known and unknown 
stmctures are aligned, the structures of the conserved amino acids in a computer 
representation of the polypeptide with known structure are transferred to the 
corresponding amino acids of the polypeptide whose structure is unknown. For 
example, a tyrosine in the amino acid sequence of known structure may be replaced 

30 by a phenylalanine, the corresponding homologous amino acid in the amino acid 
sequence of unknown structure. 
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The structures of amino acids located in non-conserved regions may be assigned 
manually by using standard peptide geometries or by molecular simulation 
techniques, such as molecular dynamics. The final step in the process is 
accomplished by refining the entire structure using molecular dynamics and/or 
5 energy minimization. 

Homology modeling as such is a technique that is well known to those skilled in the 
art (see e.g. Greer, Science, vol. 228, (1985), 1055, and Blundell et aL, Eur. J. 
Biochem, vol. 172, (1988), 513). The techniques described in these references, as 
10 well as other homology modeling techniques, generally available in the art, may be 
used in performing the present invention. 

Thus the invention provides a method of homology modeling comprising the steps 
of: (a) aligning a representation of an amino acid sequence of a target sulfenyl 

15 amide protein tyrosine phosphatase protein of unknown three-dimensional structure 
with the amino acid sequence of the sulfenyl amide protein tyrosine phosphatase IB 
of Table 1 or Table 2 to match homologous regions of the amino acid sequences; 
(b) modeling the stmcture of the matched homologous regions of said target 
sulfenyl amide protein tyrosine phosphatase of unknown structure on the 

20 corresponding regions of the sulfenyl amide protein tyrosine phosphatase IB 

stmcture as defined by Table 1 or Table 2; and (c) determining a conformation (e.g. 
so that favorable interactions are formed within the target sulfenyl amide protein 
tyrosine phosphatase of unknown structure and/or so that a low energy 
conformation is formed) for said target sulfenyl amide protein tyrosine phosphatase 

25 of unknovra structure which substantially preserves the structure of said matched 
homologous regions. 

Preferably one or all of steps (a) to (c) are performed by computer modeling. 

30 The aspects of the invention described herein which utilize the sulfenyl amide 
protein tyrosine phosphatase IB stmcture in silica may be equally applied to 
homologue models of sulfenyl amide protein tyrosine phosphatase obtained by the 
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above aspect of the invention, and this application forms a further aspect of the 
present invention. Thus having determined a conformation of a sulfenyl amide 
protein tyrosine phosphatase by the method described above, such a conformation 
may be used in a computer-based method of rational drug design as described 
5 herein. 

In a preferred aspect of this invention the co-ordinates are used to model the 
stmcture of target sulfenyl amide protein tyrosine phosphatases, particularly 
homologues of sulfenyl amide protein tyrosine phosphatase IB, for example PTP-a, 
10 T-cell PTP, or LAR. 

Structure solution 

The stmcture of the human sulfenyl amide protein tyrosine phosphatase IB can also 
be used to solve the crystal structure of other target sulfenyl amide protein tyrosine 
15 phosphatase proteins including other crystal forms of sulfenyl amide protein 
tyrosine phosphatase IB, mutants, and co-complexes of sulfenyl amide protein 
tyrosine phosphatase IB, where X-ray diffraction data or NMR spectroscopic data 
of these target sulfenyl amide protein tyrosine phosphatase proteins have been 
generated and require interpretation in order to provide a structure. 

20 

In the case of sulfenyl amide protein tyrosine phosphatase IB, this protein may 
crystallize in more than one crystal form. The stmcture coordinates of sulfenyl 
amide protein tyrosine phosphatase IB, or portions thereof, as provided by this 
invention are particularly useful to solve the stmcture of those other crystal forms 
25 of sulfenyl amide protein tyrosine phosphatase IB. They may also be used to solve 
the stmcture of sulfenyl amide protein tyrosine phosphatase IB mutants, sulfenyl 
amide protein tyrosine phosphatase IB co-complexes, or the stmcture of the 
crystalline form of any other protein with significant amino acid sequence 
homology to sulfenyl amide protein tyrosine phosphatase IB. 

30 

In the case of other target sulfenyl amide protein tyrosine phosphatase proteins, 
particularly the mutant sulfenyl amide protein tyrosine phosphatase proteins 



31 



referred to above, the present invention allows the structures of such targets to be 
obtained more readily where raw X-ray diffraction data are generated. 

Thus, where X-ray crystallographic or NMR spectroscopic data are provided for a 
5 target sulfenyl amide protein tyrosine phosphatase IB-ligand complex, or a sulfenyl 
amide protein tyrosine phosphatase IB homologue or analogue of unknown three- 
dimensional structure, the structure of sulfenyl amide protein tyrosine phosphatase 
IB, as defined by Table 1 or Table 2, may be used to interpret the data to provide a 
likely structure for the other sulfenyl amide protein tyrosine phosphatases by 
10 techniques which are well known in the art, e.g. phasing in the case of X-ray 
crystallography and assisting peak assignments in NMR spectra. 

One method that may be employed for these purposes is molecular replacement. In 
this method, the unknown crystal structure, whether it is another crystal form of 

1 5 sulfenyl amide protein tyrosine phosphatase IB, a sulfenyl amide protein tyrosine 
phosphatase IB mutant, or a sulfenyl amide protein tyrosine phosphatase IB co- 
complex, or the crystal of a target sulfenyl amide protein tyrosine phosphatase 
protein with amino acid sequence homology to protein tyrosine phosphatase IB, 
may be determined using the sulfenyl amide protein tyrosine phosphatase IB 

20 structure coordinates of this invention as provided herein. This method will provide 
an accurate structural form for the unknown crystal more quickly and efficiently 
than attempting to determine such information ab initio. 

Examples of computer programs known in the art for performing molecular 
25 replacement are CNX (Brunger A.T.; Adams P.D.; Rice L.M., Current Opinion in 
Structural Biology, Volume 8, Issue 5, October 1998, Pages 606-61 1 (also 
commercially available from Accelrys San Diego, CA) or Amore (Navaza, J. 
(1994). Amore: An Automated Package for Molecular Replacement. Acta Cryst. 
A50, 157-163). 

30 

Thus, in a further aspect of the invention provides a method for determining the 
structure of a protein, which method comprises providing the co-ordinates of Table 
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1 or Table 2, and either (a) positioning the co-ordinates in the crystal unit cell of 
said protein so as to provide a structure for said protein or (b) assigning NMR 
spectra peaks of said protein by manipulating the coordinates of Table 1 or Table 2. 

5 In a preferred aspect of this invention the co-ordinates are used to solve the 
structure of target sulfenyl amide protein tyrosine phosphatase, particularly 
homologues of sulfenyl amide protein tyrosine phosphatase IB, for example PTP-a, 
T-cell PTP, or LAR. 

Computer svstems 

10 In another aspect, the present invention provides systems, particularly a computer 
system, the systems containing either (a) atomic coordinate data according to Table 
1 or Table 2, said data defining the three-dimensional structure of sulfenyl amide 
protein tyrosine phosphatase IB or at least selected coordinates thereof; (b) 
structure factor data (where a structure factor comprises the amplitude and phase of 

15 the diffracted wave) for sulfenyl amide protein tyrosine phosphatase IB, said 

structure factor data being derivable from the atomic coordinate data of Table 1 or 
Table 2; (c) atomic coordinate data of a target sulfenyl amide protein tyrosine 
phosphatase protein generated by homology of the target based on the data of 
Table 1 or Table 2; (d) atomic coordinate data of a target sulfenyl amide protein 

20 tyrosine phosphatase protein generated by interpreting X-ray crystallographic data 
or NMR data by reference to the data of Table 1 or Table 2; or (e) structure factor 
data derivable from the atomic coordinate data of (c) or (d). 

For example the computer system may comprise: (i) a computer-readable data 
25 storage medivun comprising data storage material encoded with the computer- 
readable data; (ii) a working memory for storing instructions for processing said 
computer-readable data; and (iii) a central-processing unit coupled to said working 
memory and to said computer-readable data storage medium for processing said 
computer-readable data and thereby generating structures and/or performing 
30 rational drug design. The computer system may fiulher comprise a display coupled 
to said central-processing vmit for displaying said structures. 
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The invention also provides such systems containing atomic coordinate data of 
target sulfenyl amide protein tyrosine phosphatase proteins wherein such data have 
been generated according to the methods of the invention described herein based on 
5 the starting data provided by Table 1 or Table 2. 

Such data are useful for a number of piuposes, including the generation of 
structures to analyze the mechanisms of action of sulfenyl amide protein tyrosine 
phosphatase IB proteins and/or to perform rational drug design of compounds 
10 which interact with sulfenyl amide protein tyrosine phosphatase IB, such as 

compounds which are inhibitors of sulfenyl amide protein tyrosine phosphatase IB. 

Li another aspect, the invention provides a computer-readable storage medium, 
comprising a data storage material encoded with computer readable data, wherein 

15 the data are defined by all or a portion (e.g. selected coordinates as defined herein) 
of the structure coordinates of sulfenyl amide protein tyrosine phosphatase IB of 
Table 1 or Table 2, or a homologue of sulfenyl amide protein tyrosine phosphatase 
IB, wherein said homologue comprises backbone atoms that have a root mean 
square deviation from the backbone atoms (nitrogen-carbona-carbon) of Table 1 or 

20 Table 2 of not more than 1 .5 A. 

The invention also provides a computer-readable data storage medium comprising a 
data storage material encoded with a first set of computer-readable data comprising 
a Fourier Transform of at least a portion (e.g. selected coordinates as defined 

25 herein) of the structural coordinates for sulfenyl amide protein tyrosine phosphatase 
IB according to Table 1 or Table 2; which, when combined with a second set of 
machine readable data comprising an X-ray diffraction pattern of a molecule or 
molecular complex of imknown structure, using a machine programmed with the 
instructions for using said first set of data and said second set of data, can determine 

30 at least a portion of the structure coordinates corresponding to the second set of 
machine readable data. 
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In a further aspect, the present invention provides computer readable media with at 
least one of: (a) atomic coordinate data according to Table 1 or Table 2 recorded 
thereon, said data defining the three-dimensional structure of sulfenyl amide protein 
tyrosine phosphatase IB, or at least selected coordinates thereof; (b) structure factor 
5 data for sulfenyl amide protein tyrosine phosphatase IB recorded thereon, the 

structure factor data being derivable firom the atomic coordinate data of Table 1 or 
Table 2; (c) atomic coordinate data of a target sulfenyl amide protein tyrosine 
phosphatase protein generated by homology modeling of the target based on the 
data of Table 1 or Table 2; (d) atomic coordinate data of a sulfenyl amide protein 
10 tyrosine phosphatase IB-ligand complex or a sulfenyl amide protein tyrosine 
phosphatase IB homologue or analogue generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1 or Table 2; 
and (e) structure factor data derivable firom the atomic coordinate data of (c) or (d). 

15 By providing such computer readable media, the atomic coordinate data can be 
routinely accessed to model sulfenyl amide protein tyrosine phosphatase IB or 
selected coordinates thereof For example, Rasmol (Sayle et al., TIBS, vol. 20, 
(1995), 374) is a publicly available computer software package which allows access 
and analysis of atomic coordinate data for structure determination and/or rational 

20 drug design. 

On the other hand, structure factor data, which are derivable from atomic coordinate 
data (see e.g. Blundell et al., in Protein Crystallography, Academic Press, New 
York, London and San Francisco, (1976)), are particularly usefiil for calculating 
25 e.g. difference Fourier electron density maps. 

A fiirther aspect of the invention provides a method of providing data for generating 
structures and/or performing rational drug design for sulfenyl amide protein 
tyrosine phosphatase IB, sulfenyl amide protein tyrosine phosphatase IB 
30 homologues or analogues, complexes of sulfenyl amide protein tyrosine 

phosphatase IB with a candidate modulator, or complexes of sulfenyl amide protein 
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tyrosine phosphatase IB homologues or analogues with candidate modulators, the 
method comprising: 

(i) establishing communication with a remote device containing computer-readable 
data comprising at least one of: (a) atomic coordinate data according to Table 1 or 
5 Table 2, said data defining the three-dimensional structure of sulfenyl amide protein 
tyrosine phosphatase IB, at least one sub-domain of the three-dimensional structure 
of sulfenyl amide protein tyrosine phosphatase IB, or the coordinates of a portion 
of atoms of sulfenyl amide protein tyrosine phosphatase IB; (b) structure factor 
data for sulfenyl amide protein tyrosine phosphatase IB, said structure factor data 

10 being derivable firom the atomic coordinate data of Table 1 or Table 2; (c) atomic 
coordinate data of a target sulfenyl amide protein tyrosine phosphatase IB 
homologue or analogue generated by homology modeling of the target based on the 
data of Table 1 or Table 2; (d) atomic coordinate data of a protein generated by 
interpreting X-ray crystallographic data or NMR data by reference to the data of 

15 Table 1 or Table 2; and (e) structure factor data derivable from the atomic 

coordinate data of (c) or (d); and (ii) receiving said computer-readable data from 
said remote device. 

Thus the remote device may comprise e.g. a computer system or computer readable 
20 media of one of the previous aspects of the invention. The device may be in a 

different coxmtry or jurisdiction from where the computer-readable data is received. 
The commimication may be via the intemet, intranet, e-mail etc. Typically the 
communication will be electronic in nature, but some or all of the conmiunication 
pathway may be optical, for example, over optical fibre transmission lines. 

25 Drug Discovery 

Determination of the 3D structure of PTPIB provides important information about 
the nature of the changes to the active site of PTPs upon oxidation, in particular the 
changes in PTPIB upon oxidation to the PTPIB sulfenyl amide. In particular, the 
X-ray data provide information about new binding sites created by distortion of the 
30 active site as a consequence of the formation of the sulfenyl amide. Information 
about the new binding sites can then be used for rational design of compovmds that 
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bind to PTPsulfenyl amide, especially PTPIB sulfenyl amide. This can be achieved 
by e.g. computational techniques which identify possible binding ligands for the 
active sites, by enabling linked-fragment approaches to drug design, and by 
enabling the identification and location of bound ligands using X-ray 
5 crystallographic analysis. These techniques are discussed in more detail belov^. 

In silico Analysis 

The provision of the crystal structure of PTPIB sulfenyl amide allows a novel 
approach for drug discovery for modulators of this inactive form of PTP and in 
particular of PTPIB. Accordingly, the invention provides a computer-based 
10 method of rational drug design which comprises: 

providing the structure of the PTP lb sulfenyl amide as defined by the 
coordinates of Table 1 or Table 2; 

providing the structure of a candidate modulator molecule; and 
fitting the structure of candidate to the structure of the sulfenyl amide of 
15 Table 1 or Table 2. 

More particularly, the crystal stmcture of the sulfenyl amide can be used to design 
drug molecules that bind to the sulfenyl amide of PTPIB to inhibit or prevent its 
conversion to the active form of PTPIB and hence another aspect of the invention 
comprises a computer-based method of rational drug design which comprises; 
20 providing the stmcture of the PTPIB sulfenyl amide as defined by the 

coordinates of Table 1 or Table 2; 

providing the stmcture of a candidate compound; and 
fitting the structure of the candidate compound to the stracture of the 
sulfenyl amide as defined by the coordinates of Table 1 or Table 2. 

25 The invention fiirther provides a method of identifying by rational dmg design a 
compound capable of reducing the level of activity of a protein tyrosine 
phosphatase (PTP) in a cellular environment, the PTP being one which is 
convertible in a cellular environment between an active form and an inactive or less 
active form, the inactive form or less active form being characterised by the 

30 presence of a sulfenyl amide moiety formed at the active site of the PTP between 
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the sulphur atom of a cysteine group and a backbone nitrogen atom of a 
neighbouring amino acid; 

which method comprises: 

(a) designing a hgand that will (i) bind to the active site in the region of the 

5 sulfenyl amide moiety to inhibit conversion of the inactive form or less active form 
back to the active form, or (ii) modify the sulfenyl amide moiety to inhibit 
conversion of the inactive form or less active form of the PTP to the active form; 

(b) synthesizing the ligand; and 

(c) determining whether the ligand reduces the level of activity of a protein 
10 tyrosine phosphate (PTP) in a cellular environment. 

In an altemative aspect, the method of the invention may utilise the coordinates of 
atoms of interest of the PTP IB which are in the vicinity of a putative binding region 
in order to model the pocket in which the a ligand will bind. These coordinates 
may be used to define a space which is then screened "iw silico" against a candidate 
1 5 modulator molecule. 

Thus the invention provides a computer-based method of rational drug design 
which comprises: 

providing the coordinates of at least two atoms of Table 1 or Table 2 of the 
PTP IB sulfenyl amide ("selected coordinates"); 
20 providing the structure of a candidate modulator molecule; and 

fitting the structure of candidate to the selected coordinates of the PTP IB 
sulfenyl amide. 

In practice, it will be desirable to model a sufficient nimiber of atoms of the PTP IB 
sulfenyl amide as defined by the coordinates of Table 1 or Table 2 which represent 
25 a binding pocket. Binding pockets and other features of the interaction of PTP IB 
sulfenyl amide with a putative compound of the invention are described below. 
Thus, in this embodiment of the invention, there will preferably be provided the 
coordinates of at least 5, preferably at least 10, more preferably at least 50 and even 
more preferably at least 100 atoms of the PTP sulfenyl amide structure. 
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By "fitting'*, it is meant determining by automatic, or semi-automatic means, 
interactions between at least one atom of the candidate and at least one atom of the 
PTPIB sulfenyl amide , and calculating the extent to which such an interaction is 
stable. Interactions include attraction and repulsion, brought about by charge, steric 
5 considerations and the like. Various computer-based methods for fitting are 
described fiirther herein. 

By "binding site" we mean a site (such as an atom, a functional group of an amino 
acid residue or a plurality of such atoms and/or groups) in a PTPIB sulfenyl amide 
binding cavity which may bind to a candidate ligand. Depending on the particular 
10 molecule in the cavity, sites may exhibit attractive or repulsive binding interactions, 
brought about by charge, steric considerations and the like. 

As a resuh of the determination of the PTPIB sulfenyl amide 3D structure, more 
purely computational techniques for rational drug design may also be used to design 
PTP sulfenyl amide ligands (for an overview of these techniques see e.g Walters et 

15 al (Drug Discovery Today, Vol.3, No.4, (1998), 160-178; Abagyan, R.; Totrov, M. 
Curr. Opin. Chem, Biol. 2001, 5, 375-382)). For example, automated ligand- 
receptor docking programs (discussed e.g. by Jones et al. in Current Opinion in 
Biotechnology, Vol.6, (1995), 652-656 and Halperin, L; Ma, B,; Wolfson, H.; 
Nussinov, R. Proteins 2002, 47, 409-443) which require accurate information on 

20 the atomic coordinates of target receptors may be used to design potential PTPIB 
sulfenyl amide ligands. 

The step of providing the stmcture of a candidate ligand molecule may involve 
selecting the compound by computationally screening a database of compounds for 
interaction with the active site. For example, a 3-D descriptor for the candidate 
25 modulator may be derived, the descriptor including geometric and fimctional 

constraints derived from the architecture and chemical nature of the binding site. 
The descriptor may then be used to interrogate the compound database, a candidate 
Hgand being a compoimd that has a good match to the features of the descriptor. In 
effect, the descriptor is a type of virtual pharmacophore. 
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In any event, the determination of the three-dimensional structure of PTPIB 
sulfenyl amide provides a basis for the design of new and specific ligands for 
PTPIB sulfenyl amide. For example, knowing the three-dimensional structure of 
PTPIB sulfenyl amides, computer modelling programs may be used to design 
5 different molecules expected to interact with possible or confirmed active sites, 
such as binding sites or other stmctural or fimctional features of PTPIB sulfenyl 
amide. 

More specifically, a candidate ligand for PTPIB sulfenyl amide can be examined 
through the use of computer modelling using a docking program such as GOLD 
(Jones et al., /. Mol Biol, 245, 43-53 (1995), Jones et al., J, Mol BioL 267, 727- 
748 (1997)), GRAMM (Vakser, LA,, Proteins , SuppL, 1:226-230 (1997)), DOCK 
(Kuntz et al, JMoLBiol 1982 , 161, 269-288, Makino et al, J,ComputChem, 1997, 
18, 1812-1825), AUTODOCK (Goodsell et al, Proteins 1990, 8, 195-202, Morris et 
al, J.Comput.Chem, 1998, 19, 1639-1662.), FlexX, (Rarey et al, JMolBiol 1996, 
261, 470-489) or ICM (Abagyan et al, J.ComputChem. 1994, 15, 488-506). . This 
procedure can include computer fitting of candidate Ugands to PTPIB sulfenyl 
amide to ascertain how well the shape and the chemical structure of the candidate 
ligand will bind to the enzyme. 

Also computer-assisted, manual examination of the active site structure of PTPIB 
20 may be performed. The use of programs such as GRID (Goodford, J. Med, Chem., 
28, (1985), 849-857) - a program that determines probable interaction sites between 
molecules with various functional groups and the enzyme surface - may also be 
used to analyse the active site to predict partial structures of ligands. 

Computer programs can be employed to estimate the attraction, repulsion, and 
25 steric hindrance of the two binding partners (e.g. the PTPIB sulfenyl amide and a 
candidate ligand). Generally the tighter the fit, the fewer the steric hindrances, and 
the greater the attractive forces, the more potent the candidate ligand since these 
properties are consistent with a tighter binding constant. Furthermore, the more 
specificity in the design of a candidate hgand, the more likely it is that it will not 
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interact with other proteins as well. This will tend to minimise potential side- 
effects due to unwanted interactions with other proteins. 

Linked fragment and fragment growing approaches. 

If more than one PTPIB sulfenyl amide binding site is characterised and a plurality 
5 of respective compoxmds are designed or selected, the candidate ligand may be 
formed by linking the respective compoxmds into a larger compound which 
maintains the relative positions and orientations of the respective compounds at the 
active sites. The larger compound may be formed as a real molecule or by 
computer modelling. 

10 Linked-fragment approaches to dmg design also require accurate information on the 
atomic coordinates of target receptors. Small compounds which have the potential 
to bind to regions of PTPIB sulfenyl amide which in themselves may not be 
modulator compounds may be assembled by chemical linkage to provide candidate 
modulators. Thus the basic idea behind these approaches is to determine the 

15 binding locations of plural ligands to a target molecule, and then construct a 

molecular scaffold to connect the hgands together in such a way that their relative 
binding positions are preserved. The ligands may be provided computationally and 
modelled in a computer system, or provided in an experimental setting, wherein 
crystals according to the invention are provided and a plurality of ligands soaked 

20 separately or in mixed pools into the crystal prior to X-ray analysis and 
determination of their location. 

For example, the binding of one or more molecular fragments can be determined in 
the protein binding cavity by X-ray crystallography. Molecular fragments are 
typically compounds with a molecular weight between 100 and 200 Da. This can 

25 then provide a starting point for medicinal chemistry to optimize the interactions 
using a structure-based approach. The fragments can be combined onto a template 
or used as the starting point for "growing out" a modulator into other cavities of the 
protein. The fragments can be positioned in the binding cavity or cavities of 
PTPIB sulfenyl amide and then 'grown' to fill the space available, exploring the 

30 electrostatic, van der Waals or hydrogen-bonding interactions that are involved in 
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molecular recognition. The potency of the original weakly binding fragment thus 
can be rapidly improved using iterative structure-based chemical synthesis. 

At one or more stages in the fragment growing approach, the compound may be 
synthesized and tested in a biological system for its activity. This can be used to 
5 guide the further growing out of the fragment. 

Where two fragment-binding regions are identified, a linked fragment approach 
may be based upon attempting to link the two fragments directly, or growing one or 
both fragments in the manner described above in order to obtain a larger, linked 
structure which may have the desired properties. 

10 Thus the binding site of two of more ligands are determined and may be connected 
to thus form a potential lead compoimd that can be further refined using e.g. the 
iterative technique of Greer et al. For a virtual linked-fragment approach see 
Verlinde et al., J. of Computer-Aided Molecular Design, 6, (1992), 131-147, and for 
NMR and X-ray approaches see Shuker et al., Science, 274, (1996), 1531-1534 and 

15 Stout et al.. Structure, 6, (1998), 839-848. The use of these approaches to design 
PTPIB sulfenyl amide modulators is made possible by the determination of the 
PTPIB sulfenyl amide structure. 

Generation and Analvsis of hgand-PTPlB sulfenvl amide complexes 

In a further aspect, the invention provides a method for determining the structure of 
20 a compoimd boxmd to sulfenyl amide PTPIB. The methods above may comprise the 
further steps of: 

obtaining or synthesising a candidate modulator; 

forming a complex of PTPIB sulfenyl amide and said candidate modulator; 

and 

25 analysing said complex by X-ray crystallography to determine the ability of 

said candidate modulator to interact with PTPIB sulfenyl amide. 

The invention also provides a method for determining the structure of a compound 
bound to sulfenyl amide PTPlb, said method comprising: (a) providing a crystal of 
sulfenyl amide PTPlb according to the invention; (b) soaking the crystal with said 
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compound; and (c) determining the structure of said sulfenyl amide PTPlb 
compound complex by employing the data of Table 1 or Table 2. 

Altematively, the sulfenyl amide PTPIB and compound may be co-crystallized. 
5 Thus the invention provides a method for determining the structure of a compound 
bound to sulfenyl amide PTPlb, said method comprising; mixing the protein with 
the compound(s), crystallizing the protein-compound(s) complex; and determining 
the structure of said sulfenyl amide PTPlb -compound(s) complex by reference to 
the data of Table 1 or Table 2. 

10 

A mixture of compoimds may be soaked or co-crystaUized with the crystal, wherein 
only one or some of the compounds may be expected to bind to the sulfenyl amide 
PTPlb. As well as the structure of the complex, the identity of the complexing 
compound(s) is/are then determined. 

15 In either case, substrate or a substrate analogue thereof may optionally be present. 

The method may comprise the further steps of: (a) obtaining or synthesising said 
compound; (b) forming a complex of sulfenyl amide PTPIB and said compound; 
and (c) analysing said complex by X-ray crystallography or NMR spectroscopy to 
determine the ability of said compound to interact with sulfenyl amide PTPlb. 

20 This information may thus be used to design and synthesize novel classes of 
sulfenyl amide PTPIB inhibitors. 

Detailed structural information can then be obtained about the binding of the 
candidate modulator to PTPIB sulfenyl amide, and in the light of this information 
adjustments can be made to the structure or functionality of the candidate 
25 modulator, e.g. to improve binding to the active site. The above steps may be 
repeated and re-repeated as necessary. 

In another aspect, the invention provides a method of analysing a complex of 
PTPIB sulfenyl amide and a candidate modulator comprising the step of employing 
(i) X-ray crystallographic diffraction data from the complex and (ii) a three- 
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dimensional structure of PTPIB sulfenyl amide, or at least one sub-domain thereof, 
to generate a difference Fourier electron density map of the complex, the three- 
dimensional structure being defined by atomic coordinate data according to Table 1 
or Table 2. 

5 Therefore, such complexes can be crystallised and analysed using X-ray diffraction 
methods, e.g. according to the approach described by Greer et al., J. of Medicinal 
Chemistry^ Vol. 37, (1994), 1035-1054, and difference Fourier electron density 
maps can be calculated based on X-ray diffraction pattems of soaked or co- 
crystallised PTPIB sulfenyl amide and the solved stmcture of imcomplexed PTPIB 
10 sulfenyl amide. These maps can then be used to determine whether and where a 
particular candidate modulator binds to PTPIB sulfenyl amide and/or changes the 
conformation of PTPIB sulfenyl amide. 

Electron density maps can be calculated using programs such as those from the 
CCP4 computing package (Collaborative Computational Project 4. The CCP4 
15 Suite: Programs for Protein Crystallography, Acta Crystallographica, D50, (1994), 
760-763.). For map visualisation and model building programs such as "O" (Jones 
et al.. Acta Crystallograhy, A47, (1991), 1 10-1 19) or QUANTA" (1994, San Diego, 
CA: Molecular Simulations, Jones et al.. Acta Crystallography A47 (1991), 110- 
119) can be used. 

20 Greer et al. mentioned above describes an iterative approach to ligand design based 
on repeated sequences of computer modelling, protein-ligand complex formation 
and X-ray analysis. Thus novel thymidylate synthase inhibitor series were designed 
de novo by Greer et al., and PTPIB sulfenyl amide inhibitors may also be designed 
in the this way. More specifically, using e.g. GRID on the solved 3D structure of 

25 PTPIB, a candidate modulator for PTPIB sulfenyl amide may be designed that 

complements the functionalities of the PTPIB sulfenyl amide binding site(s). The 
candidate modulator compound can then be synthesised, formed into a complex 
with PTPIB sulfenyl amide, and the complex then analysed by X-ray 
crystallography to identify the actual position of the boimd compound. 
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Determination of the position of the candidate modulator in the complex allows 
determination of the interactions of it with PTPIB sulfenyl amide. This will allow 
those of skill in the art to analyse the affinity and specificity of the compound for 
PTPIB sulfenyl amide, and to propose modifications to the compound to increase 
5 or decrease either or both of these properties. Thus the structure and/or functional 
groups of the compoimd can then be adjusted, if necessary, in view of the results of 
the X-ray analysis, and the synthesis and analysis sequence repeated until an 
optimised compound is obtained. Related approaches to structure-based drug 
design are also discussed in Bohacek et al., Medicinal Research Reviews , Vol.16, 
10 (1996), 3-50. 

Many of the techniques and approaches to structure-based drug design described 
above require X-ray analysis to identify the binding position of a candidate 
modulator in a complex with a protein. A common way of doing this is to perform 
X-ray crystallography on the complex, produce a difference Fourier electron 
15 density map, and associate a particular pattem of electron density with the 

candidate modulator. However, in order to produce the map (as explained e.g. by 
Blundell et al. mentioned above) it is necessary to know beforehand the protein 3D 
structure (or at least the protein structure factors). 

Therefore, determination of the PTPIB sulfenyl amide structure also allows 
20 difference Fourier electron density maps of complexes of PTPIB sulfenyl amide 
with a candidate modulator to be produced, which can greatly assist the process of 
rational drug design. 

The approaches to structure-based dmg design described above all require initial 
identification of possible compounds for interaction with target bio-molecule (in 

25 this case PTPIB sulfenyl amide). Sometimes these compoxmds are known e.g. 
from the research literature. However, when they are not, or when novel 
compounds are wanted, a first stage of the drug design program may involve 
computer-based in silico screening of compound databases (such as the Cambridge 
Structural Database) with the aim of identifying compounds which interact with the 

30 binding site or sites of the target bio-molecule (see Martin, J. Med, Chem.^ vol 35, 
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2145-21 54 (1992)). Screening selection criteria may be based on pharmacokinetic 
properties such as metabohc stabihty and toxicity. However, determination of the 
PTPIB sulfenyl amide structure allows the architecture and chemical nature of each 
PTPIB active site to be identified, which in tum allows the geometric and 
5 functional constraints of a descriptor for the potential inhibitor to be derived. The 
descriptor is, therefore, a type of virtual 3-D pharmacophore, which can also be 
used as selection criteria or filter for database screening. 

The Crvstal Structure of the Catalvtic Domain of PTPIB Sulfenvl Amide 

The structure of the portion of the PTP sulfenyl amide corresponding to the 
10 catalytic domain of PTPIB is defined by the atomic coordinates set out in Table 1 
or Table 2. The three dimensional structure of the binding sites of the PTPIB 
sulfenyl amide are shown schematically in Figures lb and Ic. 

The crystal structure shows electron density close to the side chain of the catalytic 
cysteine characteristic of the presence of a covalent bond between the sulphur Sy 

15 atom of Cys215 and the backbone nitrogen atom of Ser216 (see Figure Ic). The 
sulfenyl-amide bond has a bond length of 1.7 A and resuhs in a five-membered 
puckered ring that has not been previously observed in proteins. In conjimction 
with the formation of the sulfenyl-amide derivative the phosphate-binding cradle 
adopts a novel conformation, distinct from the structure of the known inactive 

20 C215S PTPIB mutant^^. The cradle has shifted into the phosphotyrosine binding 
site and stabilises the sulfenyl-amide by a hydrophobic interaction with the side 
chain of Ile219 (Figure lb). In addition, the side chain of Gbi262 moves out of the 
active site and also the pTyr loop adopts a imique conformation (Figure lb). The 
more exposed conformation of the pTyr loop results from the loss of the hydrogen 

25 bond between the hydroxyl groups of Tyr46 and Ser216, which anchors the pTyr 
loop in native PTPIB and is stabilised by a network of water molecules mediating 
interactions between Asp48 and the rest of the protein. 

Formation of the sulfenyl-amide arises from oxidation of the active site Cys215, 
most likely via oxidation of Cys215 to sulfenic acid, followed by a nucleophihc 
30 attack of the backbone nitrogen atom of Ser2 1 6 on the Sy atom of Cys2 1 5 . Indeed, 
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it has been postulated that the hydrogen bond interaction between the carbonyl 
oxygen atom of Cys215 and the Nl atom of the invariant His214 side chain in 
native PTPIB increases the partial charge on the backbone nitrogen atom of 
Ser216^^, enhancing its reactivity and supporting a nucleophiUc substitution 
5 mechanism. In vivo the sulfenic acid can be formed by oxidation with hydrogen 
peroxide (H202)^^, but under our experimental conditions it is most likely triggered 
by redox-cycling of 2-Phenyl-isoxazoUdine-3,5-dione (figure 2). The leaving group 
in the cyclisation reaction could be a water molecule in an SN2 substitution reaction 
(direct mechanism. Figure 2). Alternatively, the sulfenic acid derivative might be 
1 0 oxidised under the experimental soaking conditions, or by physiological oxidants 
such as hydrogen peroxide or oxidised glutathione in vivo, to form a highly reactive 
intermediate. This would result in faster formation of the sulfenyl-amide species 
(oxidative mechanism. Figure 2). 

The role for the sulfenyl-amide intermediate in oxidative regulation of PTPIB in 
15 cells is likely to be the prevention of irreversible oxidation of Cys215 and thus 
facilitation of its thiol-mediated reactivation (Figure 2). To demonstrate the 
reversibility of the S-N bond, reducing conditions were employed in an attempt to 
reduce the sulfenyl-amide derivative in the crystals. First, two PTPIB crystals were 
soaked in a solution containing 2-Phenyl-isoxazoUdine-3,5-dione. X-ray data 
20 collected from one of the crystals confirmed the formation of the sulfenyl-amide 
bond and the concomitant conformational changes in the active site. The second 
crystal was back-soaked in 20mM reduced glutathione in an attempt to reduce the 
Cys215 sulfenyl-amide derivative back to its native form. Indeed, X-ray data 
collected from the back-soaked crystal showed the entire active site back in its 
25 native conformation, thus structurally confirming reactivation of the sulfenyl-amide 
PTPIB derivative by a biologically relevant reducing agent and strengthening the 
hypothesis of a protective role in PTPIB redox-regulation. 

The studies carried out provide a detailed structural imderstanding of the 
intermediates involved in redox-regulation of PTPIB and reveal a novel oxidation 
30 state of its catalytic cysteine. The formation of the sulfenyl-amide intermediate is 
an elegant mechanism to protect Cys215 from fixrther oxidation, and the 
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concomitant conformational changes of the phosphate-binding cradle and pTyr loop 
may serve to signal the inactive state of the enzyme. The structures of the sulfenic 
acid and sulfenyl-amide derivative indicate that reactivation of PTPIB appears to 
be facilitated by the sulfenyl-amide form. 

5 The sulfenyl-amide form of PTPs is an important regulatory intermediate of these 
proteins. It is stable to oxidation to the sulfinic or sulfonic protein forms that are 
irreversibly inhibited and therefore prevents permanent inactivation of the protein. 
It can then subsequently be converted to the reduced, active form of the protein by 
physiological reducing agents such as thiols. The sulfenyl amide is an 
10 isothiazolidin-3-one ring system, which has not been previously observed in 

proteins. It is an electrophilic species in the active site of an enzyme, and this is also 
very imusual as most enzymes display nucleophiles in their catalytic machinery. 

Therapeutic and Medical Uses of the Inhibitors of PTP Sulfenvl Amide 

It is envisaged that compounds that stabilize the sulfenyl amide form or effect 
15 reversible or irreversible covalent modification of the sulfenyl amide form v^ill be 
useful as therapeutic agents in the treatment of disease states or conditions mediated 
by protein tyrosine phosphatases. 

Accordingly, in one aspect, the invention provides the use of a compound for the 
manufacture of a medicament for the treatment of a disease or condition mediated 
20 by protein tyrosine phosphatase, wherein the compound is one that binds to protein 
tyrosine phosphatase sulfenyl amide to prevent or inhibit conversion of the protein 
tyrosine phosphatase sulfenyl amide to an active reduced form of the protein 
tyrosine phosphatase. 

In a further aspect, the invention provides a method of reducing the activity of a 
25 protein tyrosine phosphotase (PTP), the PTP being one which is convertible 
between an active form and an inactive or less active form, the inactive or less 
active form being characterised by the presence of a sulfenyl amide moiety formed 
at the active site of the PTP between the sulphur atom of a cysteine group and a 
backbone nitrogen atom of a neighbouring amino acid, whereby the sulfenyl amide 



48 



moiety distorts and inactivates the active site of the PTP and wherein the sulfenyl 
amide moiety is disruptible to restore the inactive or less active form of the PTP to 
the active form thereof; 

which method comprises inhibiting disruption of the sulfenyl amide moiety, 
5 or modifying the sulfenyl amide moiety to prevent restoration of the inactive or less 
active form of the PTP to the active form. 

Li another aspect, the invention provides a method of inhibiting or preventing the 
reduction of sulfenyl amide PTP IB to PTP IB in a cellular environment by exposing 
the PTPIB to a ligand capable of binding to the sulfenyl amide PTPIB in the region 
10 of the sulfenyl amide moiety so as to prevent reduction of the sulfenyl amide 
moiety by an intracellular reducing agent. 

The invention also provides a method of inhibiting or preventing the reduction of 
sulfenyl amide PTPIB to PTPIB in a cellular environment by exposing the PTPIB 
to a Ugand capable of binding to the sulfenyl amide PTPIB in the region of the 
1 5 sulfenyl amide moiety, the ligand having a nucleophilic moiety capable of 
modifying the sulfenyl amide moiety so as to prevent its reduction by an 
intracellular reducing agent. 

The Compounds of the Invention 

The invention provides novel compounds per se that inhibit protein tyrosine 
20 phosphatases by interacting with the sulfenyl amide PTP to prevent or inhibit 
conversion of the PTP sulfenyl amide to an active reduced form of the protein 
tyrosine phosphatase. 

In addition to compounds per se, the invention provides compoxmds of the aforesaid 
type for use in therapy or for use in medicine, for example for use in the treatment 
25 of diseases or conditions mediated by protein tyrosine phosphatase. 



Conversion of the PTP sulfenyl amide to the corresponding active reduced form can 
be inhibited in several ways by small molecule ligands. 
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Mode 1. Non-covalent binding inhibitors that stabiUse the sulfenyl-amide protein 
form. These inhibitors are designed to prevent physiological cell cycling of the 
protein form into its active reduced form by preventing binding to the protein. 

Mode 2. Reversible covalent binding inhibitors that modify the sulfenyl-amide form 
5 of the protein. These inhibitors are designed to react with the active site sulfenyl 
amide, and in so doing, prevent its reactivation by physiological cell cycling. 

Mode 3. Irreversible covalent binding inhibitors that modify the sulfenyl-amide 
form of the protein. These inhibitors are designed to react with the active site 
sulfenyl amide, and in so doing, prevent its reactivation by physiological cell 
10 cycling. 

The concerted distortion of the phosphate binding cradle and phosphotyrosine 
recognition loop upon sulfenyl-amide formation destroys the normal 
phosphotyrosine binding site and creates a new groove (referred to herein as "the 
first binding site") in which small molecules could bind. This groove is lined by 
15 residues 41-47 of the phosphotyrosine recognition loop, residues 88-90, 1 15 to 120, 
residues 179 to 184 of the WPD-loop, residues 215 to 219 of the phosphate-binding 
cradle, and residues 262-266. 

Compounds having Mode 1 activity include compounds that can make polar 
interactions at the first binding site with one or more of: 



(1) 


Lys41 


(2) 


Asn42 


(3) 


Arg45 


(4) 


Tyr46 


(5) 


Arg47 


(6) 


Asn90 


(7) 


GlnllS 


(8) 


Lyslie 


(9) 


SerllS 


(10) 


Lysl20 


(11) 


Trpl79 
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(12) Ser216 

(13) Arg221 

(14) Glii262 

(15) Thr263 

5 (16) Asp265, and 
(17) Glii266 

The amino acid numbering convention used above refers to the ntmibering of 
PTPIB. 

Preferably, the compounds make polar interactions with two or more of the listed 
10 moieties (1) to (17), more preferably three or more, for example four or more, and 
more particularly five or more. 

The compounds can make hydrophobic interactions with one or more of: 



(18) 


Leu88 


(19) 


Pro89 


15 (20) 


Leull9 


(21) 


Phel82 


(22) 


Glyl83 


(23) 


Vail 84 


(24) 


Ala217 


20 (25) 


ne219 


(26) 


the apolar part of Arg221, and 


(27) 


the apolar part of Gln262. 



Preferably, the compounds make hydrophobic interactions with two or more of the 
listed moieties, more preferably three or more, for example five or more. 

25 Additional hydrogen bonds and hydrophobic interactions may be formed between a 
bound molecule and the protein backbone. 

A second shallow depression in which a small molecule could bind (referred to 
hereinafter as "the second binding site") is located on the other side of the distorted 
protein tyrosine recognition loop and includes residues firom the second phosphate 
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binding site in PTPIB. This potential binding area is roughly defined by residues of 
the WPD-loop, the pTyr recognition loop and the loop containing residues 28-32. 

Compounds having Mode 1 activity include compounds that can make polar 
interactions at the second binding site with one or more of: 



(44) 


Arg24 


(14) 


Gln262 


(45) 


Arg254 


(46) 


Asn 44 


(5) 


Arg47 


(4) 


Tyr46 


(1) 


Lys41 


(47) 


Lys36 


(48) 


Asp29 


(49) 


Cys32 


(50) 


Ser50 



The compoimds can make hydrophobic interactions with one or more of: 



(51) 


Leu250 


(14) 


Gln262 


(41) 


Met258 


(35) 


Val49 


(4) 


Tyr46 


(39) 


Gly218 


(52) 


Gly259 


(53) 


Phe52 


(42) 


Leu260 


(54) 


Leu261 


(55) 


Ala35 and 


(56) 


the backbone of Asp48. 



In the middle of this binding area a third potential binding site (hereinafter referred 
30 to as "the third binding site") has been created as a result of the distortion of the 
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phosphate-binding cradle. This water filled cavity is located directly under the 
distorted phosphate-binding cradle and has a narrow entrance between residues 
Val49, Gly2 1 8 and Ghi 262. The cavity walls are formed by Asp48, Val49, Leu83, 
Gly218, Gly220, Ser222, Arg257, Gly259, Ghi262 and the sulfenyl-amide. 

5 Accordingly, compounds having Mode 1 activity include compounds that can make 
polar interactions at the third binding site with one or more of: 
(3) Arg45 

(29) Asp48 

(30) Ser222 
10 (31) Arg257 

(14) Ghi262 

(33) the protein backbone of one or more of (i) Thr84, (ii) Gly21 8, (iii) Gly220, 
(iv) Gly223, (v) Met258, (vi) and Gly259; 

(34) and the sulfenyl-amide residue. 

1 5 Preferably the compounds can make polar interactions at two or more (more 

preferably three or more, four or more, or five or more) of the residues (3), (29) to 

(31) . (14), (33) and (34). 



Hydrophobic interactions with the compound can be made at the third binding site 



20 by: 




(35) 


Val49 


(36) 


Leu83 


(37) 


Gln85 


(38) 


Gly86 


25 (39) 


Gly218 


(40) 


Gly220 


(41) 


Met258 


(42) 


Leu260 and 


(43) 


the main chain of His214. 
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Preferred interactions between the compounds of the invention and the three 
binding sites described above are as follows: 

First binding site: 

Polar: Trp79, Arg221, Lys46, Glu266, Arg45, SerllS 
5 Hydrophobic: Ile219, Leu88, Ilel20 

Second binding site: 

Polar: Arg254, Lys36, Asp29, Gln262 

Hydrophobic: Met258, Val49 

10 

Third binding site (cavity): 
Polar: Arg257, Asp48, Ser222 
Hydrophobic: sulfenyl amide, Val 49, Leu83 

15 As described above, compounds usefiil in the invention include those that bind to 
the sulfenyl amide PTP in the region of the sulfenyl amide moiety thereby to 
prevent reduction or other reaction of the sulfenyl amide with an endogenous 
intracellular molecule such as glutathione and conversion back to the active form of 
the PTP. However, in an altemative embodiment of the invention, the compounds 

20 can be ligands that possess a nucleophilic functional group that can react either 

reversibly {Mode 2) or irreversibly (Mode 3) with the electrophilic sulfenyl amide 
active. Scheme 1 below illustrates how ligands inhibit the action of the protein in 
the cell by preventing it from being converted back to an active form. 
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E-SH 
(active) 



Oxidation 



E-S-OH 



Reduction 
1^ RSH 
^ RS— SR 



Cyclisation 



RSH 



E-S— SR 



E*- 





Sulfenyl Amide 
Enzyme Intermediate 
(inactive) 

Nuc-L 



Covalently modified protein 
(Inactive) 

SCHEME 1 



The compounds of the invention can thus take the form of nucleophilic hgands, 
having a nucleophihc group that will react with the sulfenyl amide moiety, and a 
binding region for binding to the sulfenyl amide PTP in the region of the sulfenyl 
amide moiety. The binding region can be one that exhibits one or more of the polar 
and non-polar interactions 1 to 56 set out above in relation to Mode 1 compounds. 



The nucleophihc group will typically contain a heteroatom (e.g. selected from 
nitrogen, sulphur, oxygen and phosphorus) that is either neutral or negatively 
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charged, and which may be located adjacent a carbon atom or another heteroatom, 
which is capable of reacting with the sulfenyl amide species. Nitrogen, oxygen and 
sulfur nucleophiles are preferred. 

Thus, the sulfenyl amide moiety can be modified by reaction with a nucleophilic 
5 ligand to prevent it from reverting to the active form of the enzyme. 

The nucleophilic group is selected from the group consisting of a thiol, disulfane, 
primary thioamide, secondary thioamide, primary thiourea, secondary thiourea, 
primary amine, secondary amine, primary hydrazine, secondary hydrazine, primary 
hydrazide, secondary hydrazide, primary hydrazone, secondary hydrazone, primary 
10 amide, secondary amide, primary urea, secondary urea, primary sulfonamide, 
secondary sulfonamide, 5-membered ring heterocycle containing NH, alcohol, 
hydroxylamine, oxime, hydroxamic acid, carboxylic acid (preferably other than an 
oxalamic acid), sulfoxide, sulfate and a nitrone. 

Particular examples of nucleophiles are set out in Table 3 below. 

1 5 Table of Nucleophiles 



Nuc-L 




Nuc = nucleophile 
L = Ligand 

Table 3 



Type of nucleophile 


Structure 


Name 


Sulphur 


L— SH 


Thiol 
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L— S-SH 


Disulfane 




A 

NH, 


Primary Thioamide 




H 


Secondary Thioamide 




S 

L— N^^NH, 
H 2 


Primary thioiirea 




S 

L— N^^N-U 
H H 


Secondary thiourea 


Nitrogen 


L— NH2 


Primary amine 




L— N— L' 
H 


Secondary amine 




L— N-NH, 
H ^ 


Primary Hydrazine 




L— N-N— L' 
H H 


Secondary Hydrazine 




J 

H 


Primary Hydrazide 




.J H 

H L 


Secondary Hydrazide 
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L=N-NH2 


Primary Hydrazone 




L=N-N— L' 
H 


Secondary Hydrazone 




2 


Primary amide 




H 


Secondary amide 




O 

L— N^^NHo 

|_| 1^1 12 


Primary urea 




o 

L— N^^N-U 
H H 


Secondary urea 




O 

II 

L— S-NH- 
II 2 
O 


Primary Sulfonamide 




o 

II 

L— S-N— L' 

I 1 LJ 

II n 
O 


Secondary Sulfonamide 




L 1 


5-membered ring heterocycle 
containing NH 


Oxygen 


L— OH 


Alcohol 
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L— N-OH 
H 


Hydroxylamine 




L=N— OH 






O 

L-^^^OH 

IN 

H 


Hydroxamic acid 




O 

OH 


Carboxylic acid (preferably 
not oxalamic acids) 




+ 


Sulfoxide 




O 

II 

L— S-O 
II 

o 


Sulfate 




+ - 

L=N-0 


Nitrone 



In the table, the symbols L and L' represents the residue of the ligand, other than 
the nucleophilic group. The residue may of course contain one or more further 
nucleophilic groups of the type shown. 

5 Compoxmds of the invention that are nucleophilic ligands will form new covalently 
bound protein-ligand species. In some cases (Mode 2), the protein-ligand species is 
capable of undergoing the reverse reaction to reform the sulfenyl amide. In other 
cases (Mode 3), the nucleophiles will form a covalently bound protein-ligand 
complex in which the reverse reaction does not occur in the environment of the 
10 active site, or occurs very slowly, so that the complex is formed irreversibly. 
Additionally, certain covalent protein-ligand complexes, formed by reaction of 
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nucleophilic ligands with the sulfenyl amide protein, may undergo additional 
reactions that prevent the reverse reaction from occurring, resulting in irreversible 
inhibition. 

For example, where the nucleophilic heteroatom is an oxygen atom, the resulting 
5 covalent protein-ligand complex will contain a sulfur oxygen bond which is 

therefore oxidised. Subsequent further oxidation under cellular conditions could 
lead to oxidation of the protein to sulfinyl or sulfonic acid oxidation states, 
irreversibly modifying the protein. 

The compounds of the invention are typically synthetic compounds that are not 
10 normally encountered in a cellular environment, although naturally occurring 

compounds derived from plant sources, marine sources or other non-manunalian 
sources may be used where appropriate. 

The compounds of the invention are typically organic compounds and can be non- 
peptides, peptides or modified peptides. In one embodiment, the compounds are 
15 not peptides. 

The compounds of the invention may comprise a scaffold formed from one or more 
optionally substituted carbocyclic or heterocyclic ring systems, the ring systems 
and/or the substituents having one or more polar or non-polar moieties for 
interacting with one or more, preferably a plurality of the binding sites 1 to 43 listed 
20 above. 

The carbocyclic and heterocyclic ring systems can be aromatic or non-aromatic ring 
systems. When the carbocyclic or heterocyclic groups are aryl or heteroaryl groups, 
they can have, for example, from 5 to 12 ring members, more usually from 5 to 10 
ring members. The term "aryl" as used herein refers to a carbocyclic group having 
25 aromatic character and the term "heteroaryl" is used herein to denote a heterocycUc 
group having aromatic character. The terms "aryl" and "heteroaryl" embrace 
polycyclic (e.g. bicyclic) ring systems wherein one or more rings are non-aromatic, 
provided that at least one ring is aromatic. The aryl or heteroaryl groups can be 
monocyclic or bicyclic groups and can be unsubstituted or substituted with one or 
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more substituents. Examples of heteroaryl groups are monocyclic and bicyclic 
groups containing from five to twelve ring members, and more usually from five to 
ten ring members. The heteroaryl group can be, for example, a five membered or 
six membered monocyclic ring or a bicyclic structure formed from fused five and 
5 six membered rings or two fused six membered rings. Each ring may contain up to 
about four heteroatoms typically selected from nitrogen, sulphur and oxygen. 
Typically the heteroaryl ring will contain up to 3 heteroatoms, more usually up to 2, 
for example a single heteroatom. In one embodiment, the heteroaryl ring contains 
at least one ring nitrogen atom. The nitrogen atoms in the heteroaryl rings can be 
10 basic, as in the case of imidazole or pyridine, or essentially non-basic as in the case 
of an indole or pyrrole nitrogen. In general the number of basic nitrogen atoms 
present in the heteroaryl group, including any amino group substituents of the ring, 
will be less than five, 

15 Examples of heteroaryl groups include but are not limited to pyridyl, pyrrolyl, 
furanyl, thiophenyl, imidazolyl, oxazolyl, oxadiazolyl, oxatriazolyl, isoxazolyl, 
thiazolyl, isothiazolyl, pyrazolyl, pyrazinyl, pyrimidinyl, triazinyl, triazolyl, 
tetrazolyl, quinolinyl, isoquinolinyl, benzfuranyl, benzthiophenyl, chromanyl, 
thiochromanyl, benzimidazolyl, benzoxazolyl, benzisoxazole, benzthiazolyl and 

20 benzisothiazole, isobenzofuranyl, isoindolyl, indolizinyl, indolinyl, isoindolinyl, 
purinyl (e.g., adenine, guanine), indazolyl, benzodioxolyl, chromenyl, 
isochromenyl, chroman, isochromanyl, benzodioxanyl, quinolizinyl, benzoxazinyl, 
benzodiazinyl, pyridopyridinyl, quinoxalinyl, quinazolinyl, cirmolinyl, phthalazinyl, 
naphthyridinyl and pteridinyl. 

25 Examples of carbocyclic aryl groups include phenyl, naphthyl, indenyl, and 
tetrahydronaphthyl. 

Examples of non-aromatic heterocyclic groups are groups having from 3 to 12 ring 
members, more usually 5 to 10 ring members. Such groups can be monocyclic or 
30 bicyclic, for example, and typically have from 1 to 5 heteroatom ring members 
(more usually 1, 2, 3 or 4 heteroatom ring members), usually selected from 
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nitrogen, oxygen and sulphur. The heterocylic groups can contain, for example, 
cyclic ether moieties (e.g as in tetrahydrofuran and dioxane), cyclic thioether 
moieties (e.g. as in tetrahydrothiophene), cyclic amine moieties (e.g. as in 
pyrrolidine), cyclic sulphones (e.g. as in sulfolane and sulfolene)), cyclic 
5 sulphoxides, cyclic sulphonamides and combinations thereof. 

Particular examples include morpholine, piperidine, pyrrolidine, pyrrolidone, 
tetrahydrofuran, tetrahydrothiophene, dioxan, tetrahydropyran, imidazoline, 
imidazolidinone, oxazoline, thiazoline, piperazine, and N-alkyl piperazines such as 
N-methyl piperazine. In general, preferred non-aromatic heterocyclic groups 
10 include tetrahydrofuran, morpholine, piperazine, piperidine, pyrrolidine and 
pyrrolidone. 

The carbocyclic and heterocyclic groups can each be unsubstituted or substituted by 
one or more substituent groups selected from halogen, hydroxy, trifluoromethyl, 
cyano, nitro, carboxy, amino, carbocyclic and heterocyclic groups having from 3 to 

15 12 ring members; a group R^-R^ wherein R* is a bond, O, CO, X^CiX\ C(X^)X^ 
X^C(X^)X\ S, SO, SO2, NR^^R*^, S02NR*^ or NR^S02; and R^ is selected from 
hydrogen, carbocyclic and heterocyclic groups having from 3 to 7 ring members, 
and a Ci-8 hydrocarbyl group optionally substituted by one or more substituents 
selected from hydroxy, 0x0, halogen, cyano, nitro, amino, mono- or di-Ci^ 

20 hydrocarbylamino, carbocyclic and heterocyclic groups having from 3 to 12 ring 

members and wherein one or more carbon atoms of the Ci.g hydrocarbyl group may 
optionally be replaced by O, S, SO, SO2, NR^ X^C(X^), C(X^)X^ or X^C(X^)X^ 

R^ and R^ are the same or different and each is hydrogen or C 1-4 
hydrocarbyl; 

25 X^ is O, S or NR' and X^ is =0, =S or =NR^ 

Where the substituent group comprises or includes a carbocyclic or heterocyclic 
group, the said carbocyclic or heterocyclic group may be xmsubstituted or may itself 
be substituted with one or more further substituent groups. In one sub-group of 
30 compounds of the formula (I), such further substituent groups may include 

carbocyclic or heterocyclic groups, which are typically not themselves further 
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substituted. In another sub-group of compounds of the formula (I), the said further 
substituents do not include carbocyclic or heterocyclic groups but are otherwise 
selected from the groups listed above in the definition of the substituents. 

5 Examples of halogen substituents include fluorine, chlorine, bromine and iodine. 
Fluorine and chlorine are particularly preferred. 

In the definition of the compounds above and as used hereinafter, the term 
"hydrocarbyl" is a generic term encompassing aliphatic, alicyclic and aromatic 

10 groups having an all-carbon backbone, except where otherwise stated. Examples of 
such groups include alkyl, cycloalkyl, cycloalkenyl, carbocyclic aryl, alkenyl, 
alkynyl, cycloalkylalkyl, cycloalkenylalkyl, and carbocyclic aralkyl, aralkenyl and 
aralkynyl groups. Such groups can be unsubstituted or substituted by one or more 
substituents as defined herein. The examples and preferences expressed below 

15 apply to each of the hydrocarbyl substituent groups or hydrocarbyl-containing 
substituent groups referred to in the various definitions of substituents for 
compoimds of the invention unless the context indicates otherwise. 

The term "alkyl" covers both straight chain and branched chain alkyl groups. 
20 Examples of alkyl groups include methyl, ethyl, propyl, isopropyl, n-butyl, isobutyl, 
tert-butyl, n-pentyl, 2-pentyl, 3-pentyl, 2-methyl butyl, 3-methyl butyl, and n-hexyl 
and its isomers. 

Examples of cycloalkyl groups are those derived from cyclopropane, cyclobutane, 
25 cyclopentane, cyclohexane and cycloheptane. 

Examples of alkenyl groups include, but are not limited to, ethenyl (vinyl), 1- 
propenyl, 2-propenyl (allyl), isopropenyl, butenyl, buta-l,4-dienyl, pentenyl, and 
hexenyl. 

30 



Examples of cycloalkenyl groups include, but are not limited to, cyclopropenyl, 
cyclobutenyl, cyclopentenyl, cyclopentadienyl and cyclohexenyl. 
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Examples of alkynyl groups include, but are not limited to, ethynyl and 2-propynyl 
(propargyl) groups. 

5 Examples of carbocyclic aryl groups include substituted and unsubstituted phenyl. 

Examples of cycloalkylalkyl, cycloalkenylalkyl, carbocyclic aralkyl, aralkenyl and 
aralkynyl groups include phenethyl, benzyl, styryl, phenylethynyl, 
cyclohexylmethyl, cyclopentylmethyl, cyclobutylmethyl, cyclopropyhnethyl and 
10 cyclopentenylmethyl groups. 

The definition "R^-R^" as used herein, includes inter alia compounds wherein R* is 
selected from a bond, O, CO, 0C(0), SC(0), NR^C(O), OC(S), SC(S), NR'C(S), 
OC(NR^), SC(NR"), NR"C(NR^), C(0)0, C(0)S, C(0)NR^ C(S)0, C(S)S, C(S) 

15 NR^ C(NR")0, C(NR")S, C(NR^)NR^ 0C(0)0, SC(0)0, NR^C(0)0, OC(S)0, 
SC(S)0, NR^C(S)0, OC(NR')0, SC(NR^)0, NR^C(NR^)0, OC(0)S, SC(0)S, 
NR"C(0)S, OC(S)S, SC(S)S, NR^C(S)S, OC(NR*=)S, SC(NR^)S, NR^C(NR*=)S, 
OC(0)NR^ SC(0)NR', NR^C(O) NR^ OC(S)NR^ SC(S) NR", NR^C(S)NR^ 
OC(NR^)NR", SC(NR")NR', NR"C(NR'^NR^ S, SO, SO2 .NR^R^ SO2NR" and 

20 NR^S02 wherein R*^ is as hereinbefore defined. 

The moiety R^ can be hydrogen or it can be a group selected from carbocyclic and 
heterocyclic groups having from 3 to 12 ring members (typically 3 to 10 and more 
usually from 5 to 10), and a C1.8 hydrocarbyl group optionally substituted as 
25 hereinbefore defined. 

Examples of hydrocarbyl, carbocyclic and heterocychc groups are as set out above. 

When present, the hydrocarbyl group can be substituted by one or more substituents 
30 selected from hydroxy, 0x0, halogen, cyano, nitro, amino, mono- or di-Ci^ 

hydrocarbylamino, and monocyclic carbocyclic and heterocyclic groups having 
from 3 to 12 (typically 3 to 10 and more usually 5 to 10) ring members. Preferred 
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substituents include halogen such as fluorine. Thus, for example, the substituent 
can be a partially fluorinated or perfluorinated group such as trifluoromethyl. 

One or more carbon atoms of the Ci.8 hydrocarbyl group may optionally be 
5 replaced by O, S, SO, SO2, NR^ X^C(X^), C(X^)X^ or X^C(X^)X^ wherein X^ and 
X^ are as hereinbefore defined. For example, 1, 2, 3 or 4 carbon atoms of the 
hydrocarbyl group may be replaced by one of the atoms or groups listed, and the 
replacing atoms or groups may be the same or different. Examples of groups in 
which a carbon atom of the hydrocarbyl group has been replaced by a replacement 
10 atom or group as defined above include ethers and thioethers (C replaced by O or 
S), amides, esters, thioamides and thioesters (C replaced by X^C(X^) or C(X^)X^), 
sulphones and sulphoxides (C replaced by SO or SO2) and amines (C replaced by 

The combination of ring system and substituents is chosen so as to give a desired 
15 level of interaction with residues (1) to (56) in the three binding sites defined above. 
The extent of the interaction between the compound and the binding sites of the 
PTP sulfenyl amide can be gauged using the computer based modelling methods 
discussed above based on the atomic coordinates set out in Table 1 or Table 2. 

Li general, the number of interactions between the compound and the PTP sulfenyl 
20 amide may be chosen so as to optimise the binding of the compound to the PTP 
sulfenyl amide. For Mode 1 inhibitor compounds, in one embodiment, it is 
preferred to maximise the number of interactions between the compoimd and the 
first and/or second and/or third binding sites so as to provide enhanced binding to 
the PTP sulfenyl amide. 

25 For example, where the compound is designed to bind to the first binding site, it is 
preferred that the compound makes polar interactions with at least seven, more 
usually at least ten, and preferably at least twelve of the residues (1) to (17) and 
hydrophobic interactions with at least two and more preferably at least four of the 
residues (18) to (27). 
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Where the compound is designed to bind to the second binding site, it is preferred 
that the compound forms polar interactions with at least two, more usually three, 
and preferably four of the residues (44), (14), (45), (46), (5), (4), (1), (47), (48), 
(49) and (50), and hydrophobic interactions with at least one or two of the residues 
5 (51), (14), (41), (35), (4), (39), (52), (53), (42), (54), (55) and (56), 

Where the compoimd is designed to bind to the third binding site, it is preferred that 
the compound forms polar interactions with at least two, more usually three, and 
preferably at least four of the residues (3), (29), (30), (31), (14), (33) and (34), and 
10 preferably at least two and more usually at least three hydrophobic interactions with 
the residues (35), (36), 37), (38), (39), (40), (41), (42) and (43). 

In one embodiment of the invention, the compoimd forms interactions at two or 
more of the first, second and third binding sites, for example, (i) the first and second 
binding sites, or (ii) the first and third, or (iii) the second and third binding sites. 

15 In another embodiment, the compound forms interactions with only the first binding 
site. 

In a further embodiment, the compound forms interactions with only the second 
binding site. 

In a further embodiment, the compound forms interactions with only the third 
20 binding site. 

In a still further embodiment, the compoimd forms interactions with all three 
binding sites. 

A compoimd is considered to have formed an interaction with a given residue at a 
binding site if the proximity between a compound or portions thereof to the 
25 molecule or portions thereof wherein the juxtaposition is energetically favored by 
electrostatic or van der Waals interactions. The distance will depend on the type of 
interaction made-hydrogen bond, salt bridge or stacking interaction 
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The term hydrogen bond refers to a favorable interaction that occurs whenever a 
suitable donor atom, Q^, bearing a proton, H, and a suitable acceptor atom, Q^, 
have a separation of < 3. 5 A and v^here the angle Q^-H-Q^ is greater than 90 
degrees. Sometimes, a single proton on a donor atom may form a plurality of 
5 suitable acceptor atoms, Q^. For example, the proton on a -NH-group may form a 
separate hydrogen bond with each of the two oxygen atoms in a carboxylate anion. 
Suitable donor and acceptor atoms are well understood in medicinal chemistry 
(G.C. Pimentel and A.L. McClellan, The Hydrogen Bond, Freeman, San Francisco, 
1960; R. Taylor and 0. Keimard, Hydrogen Bond Geometry in organic Crystals, 
10 Accounts of Chemical Research, 17, pp. 320-326 (1984)). 

The term "hydrogen bonding moiety" refers to a chemical structure containing one 
or more suitable hydrogen bond donor moieties or hydrogen bond acceptor 
moieties. 

The term "hydrogen bonding donor moiety*' refers to a chemical structure 
1 5 containing a suitable hydrogen bond donor atom bearing one or more protons. 
Examples of donor atoms having one proton are -OH, -SH and -NH-. 

Examples of donor atoms having more than one proton are -NH2, [-NH3]- and [- 
NH2'^ 

The term hydrogen bonding acceptor moiety refers to a chemical structure 
20 containing a suitable hydrogen bond acceptor atom. Examples of acceptor atoms 
include fluorine, oxygen, sulfur and nitrogen. 

The term stacking interaction refers to the favorable attractive interactions between 
two aromatic ring systems, wherein the two rings are juxtaposed such that they are 
oriented either face-to-face, perpendicular or at an intermediate angle to each other. 

25 Face-to-face stacking interactions are usually between 3.5-4.5 Angstrom. Face-edge 
stackings are usually to be between 3.5 and 4 A. Most aromatic protein interactions 
involve separation distances of 3.6 to 3.8 Angstrom. Fully stacked interactions are 
not usually observed. Most common are staggered stacked structures with tilted 
rings. Perpendicular stacking may be face-edge or cogwheel. Both are common. 
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(Protein-protein recognition via side-chain interactions; Thornton et al.; 
Biochemical society transactions.; 927-930 (1988)). 

The term salt bridge refers to the non-covalent attractive interaction between a 
positively charged moiety (P) and a negatively charged moiety (N) when the 
5 distance between the centers of mass of P and N is between 2 and 6 Angstroms. In 
calculating the center of mass, atoms which may contain a formal charge and atoms 
immediately adjacent to these are included. For example, a salt bridge may be 
formed between the positively charged guanidinium side chain of an arginine 
residue and the negatively charged carboxylate side chain of a glutamate residue. 
10 Salt bridges are well known in medicinal chemistry (L. Stryer, Biochemistr , 

Freeman, San Francisco, (1975); K.A. Dill, Dominant Forces in Protein Folding, 
Biochemistry, 29, No. 31, pp. 7133- 7155, (1990)) . 

The term center of mass refers to a point in three-dimensional space that represents 
a weighted average position of the masses that make up an object. 

15 In order to form a desired nimiber of binding interactions with the first binding site, 
the compound preferably has a binding domain no longer than about 35 Angstrom 
long (the length of the first binding groove). Typically compounds will be, for 
example, 5-30, 5-25, 5-15, 10-15 A in length. 

In order to form a desired number of binding interactions with the second binding 
20 site, the compound preferably has a binding domain that can fit into an area of 
about 30 by 30 Angstrom (the area of the second binding site). In order optimise 
interactions at this binding site, compoimds will typically be, for example, 5-30, 5- 
25, 5-15, 10-15 A in length. 

The third binding site, the cavity imdemeath the cradle, is about 9 by 15 Angstrom 
25 so only small binding domains or small molecules will fit in this cavity, for 
example those that are 5-15, 10-15 A in length. 

A compound of the invention can possess only a single binding domain or can have 
binding domains enabling it to bind to two or three binding sites. 
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Compounds exhibiting Type 2 or Type 3 activity typically have a binding domain 
enabling them to bind to the first and/or second and/or third binding sites so as to 
bring the nucleophilic group into reactive proximity of the sulfenyl amide group. 
The nucleophilic groups can form part of the scaffold described above or can take 
5 the form of substituents attached to the scaffold. 

Screening of compounds 

In another aspect, in place of in silico methods, high or low throughput screening of 
compounds to select compounds with binding activity may be undertaken, and 
those compounds which show binding activity may be selected as possible 
10 candidate modulators, and further crystaUized with PTPIB sulfenyl amide (e.g. by 
co-crystallization or by soaking) for X-ray analysis. The resulting X-ray structure 
may be compared with that of Table 1 or Table 2 for a variety of purposes. The 
screen may utilise any of the assays detailed below. 

Assavs for Screening for Active Compoimds 

1 5 Compounds may be identified in high-throughput or low-throughput screening as 
outlined above, utilizing the assays detailed below. Compounds screened may 
include those available fi-om commercially available sources, compoimds generated 
by standard synthetic chemistry methods, or those that are part of a corporate 
compound collection. 

20 Alternatively, once a candidate inhibitor compound has been identified, for example 
by computer based rational drug design techniques as described above, the 
compounds are synthesized and tested. Whether or not the compounds are 
inhibitors of the PTP sulfenyl amide can be determined by one of a number of 
assays. Consequently, all the methods of compoimd design and identification 

25 above (e.g. in silico analysis, ligand-sulfenyl amide PTP structure determination 
etc) preferably fiirther comprise the fiirther steps of: 

obtaining or synthesising the candidate modulator; and 
contacting the candidate modulator with PTPIB sulfenyl amide to determine 
the ability of the candidate modulator to interact with PTPIB sulfenyl amide. 
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For example, in one assay, the oxidized form (sulfenyl amide form) of a 
recombinant or extracted protein tyrosine phosphatase is incubated with a candidate 
binding compound and a determination is made as to whether the compoimd is able 
to interact with the oxidized (sulfenyl amide) form of the protein tyrosine 
S phosphatase. 

Such assays require the formation of the oxidized form of a protein tyrosine 
phosphatase. The oxidized form can be produced by incubating the protein tyrosine 
phosphatase in the presence of oxidizing agents such as a reactive oxygen species in 
a cellular environment organic peracids e.g. MCPBA, peroxides e.g. hydrogen 
10 peroxide ^"^'^^ or compound(s) as described in the examples below. 

The assay can be a binding assay. Such a binding assay can be competitive or non- 
competitive and can accommodate the screening of a large nimiber of compounds to 
determine if the compoxmds are capable of binding to the oxidized protein tyrosine 
phosphatase. Subsequently other assays can be carried out with compounds found 
15 to bind to determine the mode of binding of these compounds. 

Alternatively, the assay can be a functional assay that identifies compoimds that 
trap the oxidized form of the protein tyrosine phosphatase and so change its ability 
to regain its functional activity on reduction. Such an assay can involve incubating 
potential trapping compoimds with the oxidized form of the protein tyrosine 
20 phosphatase and determining if protein tyrosine phosphatase activity can be 
regained upon reduction. 

In a further alternative, the assay can be cell-based assay for identifying compounds 
which modulate the cell-based activity of a protein tyrosine phosphatase, through 
binding to the oxidized form of protein tyrosine phosphatases in cells. 

25 Particularly preferred types of assays include binding assays, functional assays and 
cell-based assays, which may be performed as follows: 
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Binding assays 

Purified, oxidized protein tyrosine phosphatases can be used for binding studies. 
Oxidized protein tyrosine phosphatase can be used in conventional filter-binding 
assays or in a high throughput scintillation proximity-type binding assay to detect 
5 binding of a radio-labelled ligand and its displacement by compounds which 
compete for the binding site. Radioactivity can be measured with a Packard 
Topcount or similar instrumentation capable of making rapid measxu-ements for 96-, 
384- or 1536-well microtitre plate formats. 

Binding to oxidized protein tyrosine phosphatases could also be measured using a 
10 fluorescently labelled ligand, which could be displaced by compoxmds, competing 
for the binding site. Binding could be detected by fluorescent polarisation methods 
using an instrument such as the Packard Fusion reader to monitor fluorescence in 
96-, 384 or 1536-well microtitre plate formats. 

Another method for studying the binding of compoimds to the oxidized protein 
15 tyrosine phosphatase makes use of a surface plasmon resonance effect, measured by 
a Biacore instrument. Oxidized protein tyrosine phosphatase could be attached to 
the biosensor chip of a Biacore and binding of test compoxmds could be monitored. 
Examples of the use of the surface plasmon resonance effect may be foxmd in 
Parsons et al (1995) Nucleic Acids Res. 23, 21 1-216 and Parsons et al (1997) Anal. 
20 Biochem.254il), 82-87. 

Binding of test compounds to an oxidized protein tyrosine phosphatase could also 
be monitored using NMR techniques. The difference between NMR spectra of a 
test compoimd with and without the oxidized protein tyrosine phosphatase could be 
analysed to determine if the compound boimd the protein tyrosine phosphatase. 
25 Competition between test compoimds and a known ligand for a binding site on the 
protein tyrosine phosphatase could also be monitored in this way. 

Preferred compoimds are those that have a Kd value of less than ImM, more 
preferably less than luM and most preferably less than lOOnM. The term "Kd" is 
used herein in its normal sense to mean the dissociation constant, which describes 
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the ratio of the concentrations at equiUbrium between the free individual 
components and the complex formed. For a complex between two components A 
andB, Kd = [A][B]/[AB]. 

Functional assays 

5 The oxidized form of a protein tyrosine phosphatase is inactive, but can be 

reactivated by reduction. An example of a functional assay to monitor compoimds 
that trap the oxidized form of the protein tyrosine phosphatase could involve 
measuring the time taken for an oxidized form of a protein tyrosine phosphatase to 
regain activity in the presence of the candidate inhibitor under reducing conditions. 

10 Candidate trapping compounds could then be screened against the oxidized form of 
the enzyme, by incubating candidate inhibitors and enzyme for a period of time and 
then adding a reducing agent such as DTT or glutathione. The activity of the 
enzyme could then be monitored at time intervals after this addition of reducing 
agent. Enzyme which has been trapped in the oxidized form by candidate inhibitors 

15 should take longer to regain activity than enzyme that has not been trapped. Time 
taken to regain activity could be measured against controls containing no test 
compound and so correlated to potency of the inhibitor. 

Assays for monitoring the activity of protein tyrosine phosphatases have been 
described in the literature and can use known substrates such as p-nitrophenyl 

20 phosphate or phosphorylated peptides (Hoppe et al., 1994, Eur. J. Biochem., 223, 
1069-1077; Bleasdale et al., 2001, Biochemistry, 40, 5642-5654; Wang et al., 1999, 
Biochim Biophys Acta, 1431, 14-23). Assays can be performed in 96- or 384- well 
microtitre format using a Molecular Devices Spectramax plate reader, allowing 
screening of a large number of compounds. The dephosphorylation of p- 

25 nitrophenyl phosphate by a FTP can be monitored be an increase in absorbance at 
405nm. The dephosphorylation of a phosphorylated peptide can be monitored by 
measuring phosphate release by the malachite green method. 
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Cell-based Assays 

Compounds can also be screened in cell-based assays, specific to the protein 
tyrosine phosphatase of interest. 

Compounds that affect PTPIB can be screened using assays that monitor the effects 
5 of insulin on cells. Examples of such assays are as follows: 

Cells such as 3T3-L1 can be differentiated into adipocytes and induced to be insuUn 
resistant. Effect of compounds on glucose transport into these cells can be 
monitored by measuring the rate of uptake of 2-[^H]deoxyglucose when the cells 
are stimulated by insulin. 

10 Another cell-based assay that can be used to monitor effects of compoxmds on 

ptplb is an assessment of insulin receptor tyrosine kinase activity. In this case the 
tyrosine kinase activity of the insulin receptor captured from cells, e.g. 3T3-L1, 
treated with test compounds is measured. Tyrosine kinase activity can be measured 
using a peptide substrate and [y^^Pj-ATP. 

15 A further assay involves monitoring the tyrosine phosphorylation of insulin 
signalling molecules in cells that have been treated with insulin and test 
compounds. Phosphorylation of molecules can be detected for example by Western 
blotting cell extracts using monoclonal antibodies (Bleasdale et al., 2001, 
Biochemistry, 40, 5642-5654). 

20 Preferred compounds of the invention are those that have IC50 values in a cellular 
assay of less than luM, more preferably less than lOOnM and most preferably less 
than lOnM. 

Pharmaceutical Formulations 

The compounds of the invention can be presented in the form of pharmaceutical 
25 compositions. 

In another aspect, therefore, the invention provides a pharmaceutical composition 
comprising a compoimd that binds to protein tyrosine phosphatase sulfenyl amide 
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to prevent or inhibit conversion of the protein tyrosine phosphatase sulfenyl amide 
to an active reduced form of the protein tyrosine phosphatase. 

The pharmaceutical compositions can be in any form suitable for oral, parenteral, 
topical, intranasal, intra-articular, ophthalmic, otic, rectal, intra-vaginal, or 
5 transdermal administration. Where the compositions are intended for parenteral 
administration, they can be formulated for intravenous, intramuscular or 
subcutaneous administration. 

Pharmaceutical dosage forms suitable for oral administration include tablets, 
capsules, caplets, pills, lozenges, syrups, solutions, powders, granules, elixirs and 
10 suspensions, sublingual tablets, wafers or patches and buccal patches. 

Pharmaceutical compositions containing compounds of the invention can be 
formulated in accordance with known techniques, see for example. Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, PA, USA. 

15 Thus, tablet compositions can contain a unit dosage of active compound together 
with an inert diluent or carrier such as a sugar or sugar alcohol, eg; lactose, sucrose, 
sorbitol or mannitol; and/or a non-sugar derived diluent such as sodivim carbonate, 
calcium phosphate, calcium carbonate, or a celluloses or derivative thereof such as 
methyl cellulose, ethyl cellulose, hydroxypropyl methyl cellulose, and starches such 

20 as com starch. Tablets may also contain such standard ingredients as binding and 
granulating agents agents such as polyvinylpyrrolidone, disintegrants (e.g. 
swellable crosslinked polymers such as crosslinked carboxymethylcellulose), 
lubricating agents (e.g. stearates), preservatives (e.g. parabens), antioxidants (e.g. 
BHT), buffering agents (for example phosphate or citrate buffers), and effervescent 

25 agents such as citrate/bicarbonate mixtures. Such excipients are well known and do 
not need to be discussed in detail here. 

Capsule formulations may be of the hard gelatin or soft gelatin variety and can 
contain the active component in solid, semi-solid, or liquid form. Gelatin capsules 
30 can be formed from animal gelatin or synthetic or plant derived equivalents thereof. 
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The solid dosage forms (eg; tablets, capsules etc.) can be coated or un-coated, but 
typically have a coating, for example a protective film coating (e.g. a wax or 
varnish) or a release controlling coating. The coating (e.g. a Eudragit ™ type 
5 polymer) can be designed to release the active component at a desired location 

within the gastro-intestinal tract. Thus, the coating can be selected so as to degrade 
under certain pH conditions within the gastrointestinal tract, thereby selectively 
release the compoimd in the stomach or in the ileum or duodenvmi. 

Instead of, or in addition to, a coating, the drug can be presented in a sohd matrix 
10 comprising a release controlling agent, for example a release delaying agent which 
may be adapted to selectively release the compound imder conditions of varying 
acidity or alkalinity in the gastrointestinal tract. Alternatively, the matrix material 
or release retarding coating can take the form of an erodible polymer (e.g. a maleic 
anhydride polymer) which is substantially continuously eroded as the dosage form 
1 5 passes through the gastrointestinal tract. 

Compositions for topical use include ointments, creams, sprays, patches, gels, 
liquid drops and inserts (for example intraocular inserts). Such compositions can be 
formulated in accordance with known methods. 

Compositions for parenteral administration are typically presented as sterile 
20 aqueous or oily solutions or fine suspensions, or may be provided in finely divided 
sterile powder form for making up extemporaneously with sterile water for 
injection. 

Examples of formulations for rectal or intra-vaginal administration include 
pessaries and suppositories which may be, for example, formed firom a shaped 
25 moldable or waxy material containing the active compound. 

Compositions for administration by inhalation may take the form of inhalable 
powder compositions or liquid or powder sprays, and can be administrated in 
standard form using powder inhaler devices or aerosol dispensing devices. Such 
devices are well known. For administration by inhalation, the powdered 
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formulations typically comprise the active compound together with an inert solid 
powdered diluent such as lactose. 

The compoimds of the invention will generally be presented in unit dosage form 
and, as such, will typically contain sufficient compound to provide a desired level 
5 of biological activity. For example, a formulation intended for oral administration 
may contain fi*om 0.1 milligrams to 2 grams of active ingredient, more usually from 
10 milligrams to 1 gram, for example, 50 milUgrams to 500 miUigrams. 
The active compoimd will be administered to a patient in need thereof (for example 
a human or animal patient) in an amoimt sufficient to achieve the desired 
10 therapeutic effect. 

In another aspect, the invention provides a method of preparing a composition 
comprising (a) identifying a the PTP sulfenyl amide modulator molecule (which 
may thus be termed a lead compoimd) by a method of any one of the other aspects 
of the invention disclosed herein and admixing the molecule with a carrier. 

15 Also provided is a method of preparing a composition comprising (a) identifying a 
the PTP sulfenyl amide modulator molecule (which may thus be termed a lead 
compound) by a method of any one of the other aspects of the invention disclosed 
herein and admixing an optimised structure of the modulator molecule with a 
carrier. 

20 The invention further provides a process for producing a medicament, 

pharmaceutical composition or drug, the process comprising: (a) identifying a the 
PTP sulfenyl amide modulator molecule (which may thus be termed a lead 
compoimd) by a method of any one of the other aspects of the invention disclosed 
herein; and (b) preparing a medicament, pharmaceutical composition or drug 

25 containing the modulator molecule. 

A further aspect of the present invention provides a method for preparing a 
medicament, pharmaceutical composition or drug, the method comprising: 
(a) identifying a the PTP sulfenyl amide modulator molecule (which may thus be 
termed a lead compoimd) by a method of any one of the other aspects of the 
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invention disclosed herein; (b) optimising the structure of the modulator molecule; 
and (c) preparing a medicament, pharmaceutical composition or drug containing the 
optimised modulator molecule. 

The above-described processes of the invention may be iterative in that the 
5 modified compound may itself be the basis for further compound design. Detailed 
structural information can be obtained about the binding of the candidate modulator 
to PTP sulfenyl amide , and in the light of this information adjustments can be made 
to the structure or functionality of the candidate modulator, e.g. to improve binding 
to the binding cavity or cavities. The above steps may be repeated and re-repeated 
10 as necessary. 

By "optimising the structure" we mean e.g. adding molecular scaffolding, adding or 
varying functional groups, or connecting the molecule with other molecules (e.g. 
using a fragment linking approach) such that the chemical structure of the 
15 modulator molecule is changed while its original modulating functionality is 

maintained or enhanced. Such optimisation is regularly imdertaken during drug 
development programmes to e.g. enhance potency, promote pharmacological 
acceptability, increase chemical stability etc. of lead compounds. 

20 Modifications typically will be those conventional in the art known to the skilled 
medicinal chemist, and will include, for example, substitutions or removal of 
groups containing residues which interact with the amino acid side chain groups of 
a the PTP sulfenyl amide structure of the invention. For example, the replacements 
may include the addition or removal of groups in order to decrease or increase the 

25 charge of a group in a test compound, the replacement of a charge group with a 
group of the opposite charge, or the replacement of a hydrophobic group with a 
hydrophilic group or vice versa. It will be understood that these are only examples 
of the type of substitutions considered by medicinal chemists in the development of 
new pharmaceutical compounds and other modifications may be made, depending 

30 upon the nature of the starting compoimd and its activity. 

Methods of Treatment 
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It is envisaged that the compounds of the invention will be useful in the prophylaxis 
or treatment of a range of disease states or conditions mediated by protein tyrosine 
phosphatases. Examples of such disease states and conditions are set out above and 
include the treatment of cancers, diabetes (diabetes type I and 11) obesity, 
5 autoimmime diseases, acute and chronic inflammation, rheumatoid arthritis, 
osteoporosis, proliferative disorders including various forms of cancer, growth 
disorders and hypertension 

Compounds of the invention are generally administered to a subject in need of such 
administration, for example a himian or animal patient, preferably a human. 

10 The compounds will typically be administered in amounts that are therapeutically 
or prophylactically useful and which generally are non-toxic. However, in certain 
situations (for example in the case of life threatening diseases), the benefits of 
administering a compound of the invention may outweigh the disadvantages of any 
toxic effects or side effects, in which case it may be considered desirable to 

15 administer compounds in amounts that are associated with a degree of toxicity. 

A typical daily dose of the compound can be in the range from 100 picograms to 
100 milligrams per kilogram of body weight, more typically 10 nanograms to 10 
milligrams per kilogram of bodyweight although higher or lower doses may be 
administered where required. Ultimately, the quantity of compound administered 
20 will be commensurate with the nature of the disease or physiological condition 
being treated and will be at the discretion of the physician. 

The compounds of the invention can be administered as the sole therapeutic agent 
or they can be administered in combination therapy with one of more other 
compoimds for treatment of a particular disease state, for example a neoplastic 
25 disease such as a cancer as hereinbefore defined. 

EXAMPLES 

The invention will now be illustrated in greater detail by reference to the specific 
embodiments described in the following non-limiting examples. 



EXAMPLE 1 
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FTP IB Expression. Purification. Crystallisation and Structure Determination 

Expression, purification and crystallisation of the catalytic domain of PTPIB 
(residues 1-321) were based on literature conditions - see Barford et al the 
entire disclosure in which is incorporated herein by reference. 

5 PTPIB Expression, Purification and Crystallization Protocol 

Using the DNA sequence of human PTPIB (Genbank nm__002827), a firagment 
encoding the N-terminal 321 residues was generated and cloned into the expression 
vector Petl9b (Noyagen) at the Ncol site enabling the initiation of translation at 
Metl. Primers used to generate the plasmid were: 

10 5'-TTTTCCATGGAGATGGAAAAGGAGTTCG-3' (SEQ.ID. NO: 1) 

5LTTTTCCATGGCTAATTGTGTGGCTCCAGGATTCG-3'. (SEQ.ID. NO: 2) 

E.coli bl21 (de3) cells transformed with Petl9b-PTP1B were grown oyemight at 
37°C in LB mediiun plus lOOjag ampicillin/ml. Typically, lOmls of this overnight 
culture was used to inoculate 1 litre of LB plus lOOfxg ampicillin/ml. Cultures were 
15 grown at 37°C for 3 hours prior to induction by addition of isopropyl-thio-p-d- 
galactopyranoside (IPTG) to a final concentration of ImM. The cultures were 
grown for a fiirther 3 hours before being harvested by centrifugation. 

All purification steps were performed at 4°C and all buffers unless stated otherwise 
contained a 1/1000 dilution of protease inhibitor cocktail III (Calbiochem). 

20 Bacterial pellets were resuspended on ice in 20mM imidazole, pH7.5, ImM EDTA, 
3mM DTT, 10% (v/v) glycerol and lysed by sonication (2 mins, 20 second pulses). 
The lysed cells were incubated with DNAase 1 (Sigma) for 10 minutes at 4°C. 
Following this, the lysate was clarified by centrifiigation at 25,000 rpm for 30 
minutes. Protein was applied to a Q-sepharose fast flow colunm incorporated into 

25 an Akta fplc system (Amersham Biosciences) at a flow rate of 4ml/min. a linear saU 
gradient (0-0.5m NaCl in 20mM imidazole, pH7.5, ImM EDTA, 3mM DTT, 10% 
(v/v) glycerol) was applied to the column and PTPIB eluted at '-300mM NaCl. 
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Fractions containing PTPIB were pooled and buffer exchanged using a 26/10 
desalting column (Amersham Biosciences) pre-equilibrated in 25niM NaH2P04 pH 
6.5, ImM EDTA, 3mM DTT, 10% (v/v) glycerol. PTPIB fractions were applied to 
a 10/10 mono s column previously equilibrated in 25mM NaH2P04 pH 6.5, ImM 
5 EDTA, 3mM DTT, 10% (v/v) glycerol. Protein fractions were eluted by applying a 
linear salt gradient (0-0.5m NaCl) to the column with PTPIB eluting at -75mM 
NaCl. PTPIB was buffer exchanged into lOmM Tris pH7.5, 25mM NaCl, 0.2mM 
EDTA and 3mM DTT. Protein purity was assessed by SDS PAGE and was 
observed to be >95% pure. This protein was subsequently concentrated to lOmg/ml 
10 and used for crystallisation. 

Using the hanging drop method ( Crystallization of nucleic acids and proteins. A 
practical approach. A. Ducmix and R. Giege. Oxford University press 1999. and 
Principles of Protein X-ray Crystallography. Jan Drenth. Springer Verlag 1994.), 
PTPIB crystals were grown at 4°C from 4|il drops (1:1 ratio protein:reservoir 
15 solution) and equilibrated against 1ml reservoir solution consisting of 12-18% (v/v) 
PEG 4000, 0.1 m HEPES (pH 7.5) and 0.2 M magnesium acetate. 

Method for assaying PTPIB activity: 

PTPIB enzyme activity was assayed by measuring the dephosphorylation of p- 
nitrophenyl phosphate to p-nitrophenol. The reaction was monitored by following 

20 the increase in absorbance at 405 nm as p-nitrophenol was produced. Standard 

assays typically contained 0.25 mM p-nitrophenyl phosphate substrate and 25 nM 
PTPIB enzyme in 50 mM HEPES, pH 6.5, 1 mM DTT, 1 mM EDTA, 0.01% 
CHAPS buffer. Assays were carried out in Costar 3696 half-area plates in an assay 
volume of lOOul on a Spectramax plate reader (Molecular Devices). Reactions 

25 were typically monitored at 20 second intervals for 30 minutes. For kinetic 

measurements of km the concentration of the p-nitrophenyl phosphate substrate was 
varied between 3 1 uM and 5 mM. 

Activity of PTPIB was also monitored by measuring the phosphate produced in the 
reaction. Standard assays were set up as described above, except that 50ul of 
30 malachite green reagent was added to quench the reaction and the absorbance at 
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620 nm was then measured on a Spectramax plate reader (Molecular Devices) to 
determine the amount of phosphate produced. Malachite green reagent is made 
from 0.2% w/v malachite green and 4.2% w/v ammonium molybdate mixed in a 
ratio of 3:1, 

5 For overnight soaking experiments crystals were transferred into a standard mother 
liquor containing 16% PEG 4000, 0.1 M HEPES (pH 7.5), 0.2 M magnesium 
acetate, 10 mM DTT and the compound of interest (See Table 3 below). 

All data were collected at 100 K. Data sets of sulfenic- sulfinic- and sulfonic acid 
PTPIB derivatives were automatically collected on a Jupiter 140 CCD detector 
10 mounted on a RU3HR rotating anode generator equipped with an ACTOR sample- 
changing robot (RigakuMSC, The Woodlands Tx, USA). They were processed and 
scaled using D*TREK^^ and converted to structure factors using programs from the 
CCP4 suite^l 

Oxidation of Cys215 to the sulfenyl-amide derivative was carried out by soaking 
15 crystals for approximately 24 hours in mother hquor without DTT, but containing 
100 mM 2-phenyl-isoxazolidine-3,5-dione. Data were collected at station 14.1 

22 

(SRS, Daresbury) and processed and scaled using MOSFLM/SCALA . 

Reversibility of the sulfenyl-amide derivative was checked by first soaking two 
crystals in 100 mM 2-phenyl-isoxazoHdine-3,5-dione for 24 hours. Sulfenyl-amide 
20 formation was confirmed by an in-house data set of one of the crystals. After back 
soaking for 24 hours in mother liquor containing 20 mM reduced glutathione, data 
collected from the other crystal confirmed the active site in its native conformation. 

All results were evaluated using the graphics programs QUANTA (Accehys, San 
Diego CA, USA) and Astexviewer™*^^. Data collection statistics are summarised 
25 in Table 2. Initial refinement was always carried out using the CCP4-based Astex 
automatic refinement scripts, followed by rounds of positional and B-factor 
refinement with REFMACS^^ alternated with manual rebuilding steps using 
QUANTA. 



81 



Monomer libraries for the different oxidation states were generated using 

22 

REFMAC5 in combination with the monomer sketcher of the CCP4GUI . 



Table 2: Crystallographic data collection and refinement statistics 




Sulfenic 


Sulfinic 


Sulfonic 


Sulfenyl- 


Back soaking experiment 




acid 


acid 


acid 


amide 


Compound 


vjlutatnione 


Data collection 














Beamline 


In-house 


In-house 


In-house 


SRS 14.1 


Li-house 


In-house 


X(A) 


1.54 


1.54 


1.54 


1.488 


1.54 


1.54 


Resolution (A) 


2.3 


2.6 


2.2 


2.2 


2.4 


2.2 


No. Observations 


43942 


29624 


53452 


83434 


103350 


108793 


reflections 




162^7 


23367 


23591 


18514 


23322 


Completeness 
(%) * 


96.0 
(94.9) 


95.9 
(95.3) 


96.0 
(86.9) 


98.9 
(97.4) 


99.8 (100) 


100 
(100) 


Rmerge^'* 


0.092 
(0.25) 


0.172 
(0.37) 


0.060 
(0.252) 


0.050 
(0.26) 


0.078 
(0.273) 


0.054 
(0.32) 


I/a<I>* 


7.3 (2.8) 


4.6(1.9) 


11.2(4.3) 


7.8 (2.8) 


8.4 (2.8) 


12.9 
(2.2) 


Refinement 














Rcryst/Rfree 


0.187/ 


0.204/ 


0.202/ 


0.243/ 


0.215/ 


0.228/ 




0.248 


0.263 


0.242 


0.283 


0.275 


0.277 


Rmsd Bond 
lengths (A) 


0.013 


0.018 


0.012 


0.009 


0.006 


0.006 


Rmsd Bond 
angles (°) 


1.4 


1.7 


1.3 


1.4 


1.3 


1.3 
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♦Numbers in parentheses indicate the highest shell values 

.^Rmerge = ZhSi|I(h,i)-<I>(h)|ZhSi<I>(h); I(h,i) is the scaled intensity of the ith 
observation of reflection h and <I>(h) is the mean value. Simmiation is over all 
measurements. 

5 Rcryst =Shki,work||Fobs|-k|Fcaicl|/£hkl|FobsI, where Fobs and Fcalc are the observed and 
calculated structure factors, k is a weighting factor and work denotes the working 
set of 95% of the reflections used in the refinement. 

Rfree=Zhki,test||Fobs|-k|Fcaicl|/2hki|Fobs|, where Fobs and Fcalc are the observed and 
calculated structure factors, k is a weighting factor and test denotes the test set of 
10 5% of the reflections used in cross validation of the refinement. 
X refers to wavelength, Rmsd to root mean square deviations. 



Composition of cocktails used in soaking experiments: 

Cocktail composition used in the soaking experiment resulting in the sulfenic acid 



TABLE 3 



Compounds Used in Soaking Experiments 



Structure of 2-Phenyl-isoxazolidine-3,5-dione: 




(Cys-SOH) PTPIB derivative: 




F 






H 
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Cocktail composition used in the soaking experiment resulting in the sulfinic acid 
(Cys-S02H) PTPIB derivative: 

S TO 



Cocktail composition used in the soaking experiment resulting in the sulfonic acid 
(Cys-S03H) PTPIB derivative: 

H " 
H 



PHARMACEUTICAL FORMULATIONS 

EXAMPLE 2 
5 fi) Tablet Formulation 

A tablet composition containing a compoxmd of the invention is prepared by mixing 
50mg of the compound with 197mg of lactose (BP) as diluent, and 3mg 
magnesium stearate as a lubricant and compressing to form a tablet in known 
manner. 

10 

rii) Capsule Formulation 

A capsule formulation is prepared by mixing lOOmg of a compound of the 
invention with lOOmg lactose and filling the resulting mixture into standard opaque 
hard gelatin capsules. 

15 
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Equivalents 

It will readily be apparent that numerous modifications and alterations may be made 
to the specific embodiments of the invention described above without departing 
fi-om the principles underlying the invention. All such modifications and alterations 
are intended to be embraced by this application. 
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TABLE 1 

Unit cell dimensions: 

a = 87.686 A, b = 87.686 A, c = 103.721 A, a = 90.00**, |3 = 90. 00*^, 
Y = 120.00° 

Space group: 
P3i 2 1 
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47 


.040 
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28 
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ATOM 


1901 


CD 2 


LEU 


A 


232 


30 . 


443 


21 


.492 


46 


.345 


1. 


00 


32 


.86 


ATOM 


1902 


N 


LEU 


A 


233 


33 . 


019 


17 


. 914 


43 


.254 


1. 


00 


27 


.18 


ATOM 


1903 


CA 


LEU 


A 


233 


33 . 


473 


17 


.442 


41 


. 957 


1. 


00 


28 


.48 


ATOM 


1904 


c 


LEU 


A 


233 


32 . 


429 


16 


.527 


41 


.298 


1. 


00 


29 


.51 


ATOM 


1905 


o 


LEU 


A 


233 


32 . 


143 


16 


.664 


40 


. 110 


1. 


00 


27 


.14 


ATOM 


1906 


CB 


LEU 


A 


233 


34 . 


809 


16 


.741 


42 


. 110 


1. 


00 


30 


.57 


ATOM 


1907 


CG 


LEU 


A 


233 


35 . 


788 


16 


. 847 


40 


. 964 


1 . 


00 


35 


.34 


ATOM 


1908 


CDl 


LEU 


A 


233 


36 . 


159 


18 


.310 


40 


. 635 


1. 


00 


39 


.45 


ATOM 


1909 


CD2 


LEU 


A 


233 


37 . 


003 


16 


. 021 


41 


. 327 


1 , 


00 


39 


. 50 


ATOM 


1910 


N 


LEU 


A 


234 


31. 


830 


15 


. 630 


42 


. 077 


1. 


00 


26 


.58 


ATOM 


1911 


CA 


LEU 


A 


234 


30 . 


833 


14 


.711 


41 


.555 


1. 


00 


29 


. 13 


ATOM 


1912 


c 


LEU 


A 


234 


29 . 


574 


15 


.443 


41 


. 127 


1 . 


00 


29 


.39 


ATOM 


1913 


o 


LEU 


A 


234 


28 . 


915 


15 


. 025 


40 


. 190 


1 . 


00 


29 


. 61 


ATOM 


1914 


CB 


LEU 


A 


234 


30 . 


463 


13 


. 643 


42 


.589 


1. 


00 


28 


. 81 


ATOM 
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CG 


LEU 


A 


234 


31. 


527 


12 


.619 


42 


.998 
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00 


29 


.20 


ATOM 


1916 


CDl 


LEU 


A 


234 


31. 


822 


11 


.669 


41 


.849 
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00 


32 


.34 


ATOM 


1917 


CD 2 
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A 


234 


31 . 


057 


11 


. 839 


44 


.210 


1 . 


00 


26 


. 85 


ATOM 
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MET 


A 


235 


29 . 


227 


16 


. 522 


41 


. 826 
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00 


33 


. 19 
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235 


28 . 


057 


17 


.314 


41 


.464 
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00 


33 


. 17 
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MET 


A 


235 


28 . 


224 


17 
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40 


. 088 
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00 


34 


. 16 


ATOM 


1921 
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235 


27 . 


267 


18 


. 060 


39 


.335 
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00 


37 


.58 
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CB 


MET 
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235 


27 . 


752 


18 


. 354 


42 


. 533 


1 . 


00 


33 


.08 


ATOM 


1923 


CG 


MET 


A 


235 


27 . 


218 


17 


. 733 


43 


. 794 


1 . 


00 


37 


. 93 


ATOM 


1924 


SD 


MET 


A 


235 


26 . 
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18 


. 903 


45 


. 116 


1 . 


00 


38 


. 83 


ATOM 


1925 


CE 


MET 


A 


235 


26 . 


505 


17 


. 871 


46 


. 353 
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00 


40 


.67 


ATOM 


1926 


N 


ASP 


A 


236 


29 . 


443 


18 


. 378 


39 


- 760 


1 . 


00 


34 


.42 


ATOM 


1927 


CA 
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A 


236 


29 . 


801 


18 


. 860 


38 


.424 


1. 


00 


35 


.78 


ATOM 


192 8 


c 


ASP 


A 


236 


29 . 


906 


17 


. 793 


37 


.323 


1. 


00 


37 


.28 


ATOM 


192 9 


o 


ASP 


A 


236 


29 . 


743 


18 


. 119 


36 


. 154 


1 . 


00 


36 


.30 


ATOM 


1930 


CB 


ASP 


A 


236 


31. 


166 


19 


. 551 


38 


.467 


1 . 


00 


40 


. 86 


ATOM 


1931 


CG 


ASP 


A 


236 


31 . 


060 


21 


. 045 


38 


.568 


1. 


00 


45 


.84 


ATOM 


1932 


ODl 


ASP 


A 


236 


30 . 


172 


21 


. 666 


37 


. 909 


1. 


00 


49 


.22 


ATOM 


1933 


0D2 


ASP 


A 


236 


31 . 


855 


21 


. 681 


39 


.276 


1. 


00 


49 


.02 


ATOM 


1934 


N 


LYS 


A 


237 


30 . 


232 


16 


. 553 


37 


.686 


1. 


00 


36 


. 32 


ATOM 


1935 


CA 


LYS 


A 


237 


30 . 


475 


15 


.475 


36 


. 723 


1 . 


00 


35 


.45 


ATOM 


1936 


c 


LYS 


A 


237 


29 . 


181 


14 


. 789 


36 


.283 


1 . 


00 


33 


.73 


ATOM 


1937 


o 


LYS 


A 


237 


28 . 


992 


14 


.476 


35 


. 105 


1. 


00 


28 


.67 


ATOM 


1938 


CB 


LYS 


A 


237 


31 . 


404 


14 


.422 


37 


. 347 


1. 


00 


36 


.84 


ATOM 


1939 


CG 


LYS 


A 


237 


32 . 


262 


13 


. 614 


36 


.365 


1 . 


00 


41 


.35 


ATOM 


1940 


CD 


LYS 


A 


237 


33 . 


411 


12 


. 887 


37 


. 086 


1 . 


00 


41 


. 67 


ATOM 


1941 


CE 


LYS 


A 


237 


34 . 


476 


12 


.316 


36 


. 122 


1. 


00 


47 


. 97 


ATOM 


1942 


NZ 


LYS 


A 


237 


35 . 


847 


13 


. 013 


36 


. 172 


1. 


00 


45 


. 85 


ATOM 


1943 


N 


ARG 


A 


238 


28 . 


299 


14 


. 534 


37 


.237 


1. 


00 


32 


.18 


ATOM 


1944 


CA 


ARG 


A 


238 


27 . 


157 


13 


. 655 


36 


. 993 


1. 


00 


34 


.57 


ATOM 


1945 


Q 


ARG 


A 


238 


26 . 


034 


14 


. 365 


36 


.262 


1. 


00 


35 


. 08 


ATOM 


1946 


o 


ARG 


A 


238 


25 . 


798 


15 


. 562 


36 


.470 


1. 


00 


32 


. 96 


ATOM 


1947 


CB 


ARG 


A 


238 


26 . 


584 


13 


. 143 


38 


. 303 
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00 


35 


.62 


ATOM 


1948 


CG 
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238 


27 . 


366 


12 


. 041 


38 


. 912 


1. 


00 


33 


.98 


ATOM 


1949 


CD 


ARG 


A 


238 


26 . 


764 


11 


. 583 


40 


. 190 
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00 


32 


. 14 


ATOM 


1950 


NE 


ARG 


A 


238 


27 . 


503 


10 


.469 


40 


.780 


1. 


00 


29 


.90 


ATOM 


1951 


CZ 


ARG 


A 


238 


27 . 


108 


9 


. 846 


41 


.875 


1. 


00 


31 


.16 


ATOM 


1952 


NHl 
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A 


238 


26. 


009 


10 


.228 


42 


.496 
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00 


36 


. 13 


ATOM 


1953 


NH2 


ARG 


A 


238 


27, 


833 


8 


. 857 


42 


.383 
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00 


37 


.35 


ATOM 
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A 


239 


25. 


327 


13 
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35 


.452 
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00 


34 


.64 


ATOM 
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239 
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099 


14 


.019 


34 


.795 
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00 


37 


.99 


ATOM 


1956 


C 


LYS 
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239 


23 . 


025 
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35 
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00 


37 


.55 


ATOM 


1957 
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A 


239 
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217 


15 


.277 


35 


.558 
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00 


35 


.27 


ATOM 
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CB 


LYS 


A 


239 
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591 


12 


.917 


33 


. 857 
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00 


39 
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CG 


LYS 


A 


239 


24 


.489 


12 


. 698 


32 


. 626 


1 
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45 


. 02 


ATOM 


1960 


CD 


LYS 


A 


239 


24 


. 059 


11 


.472 


31 


. 773 


1 


. 00 


52 


. 00 


ATOM 


1961 


CE 


LYS 


A 


239 


23 


. 645 


11 


. 832 


30 


.330 


1 


.00 


54 


.36 


ATOM 
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NZ 


LYS 


A 


239 


24 


. 672 


11 
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29 


.329 


1 


.00 


57 


.23 


ATOM 


1963 


N 


ASP 


A 


240 


23 


. 042 


13 


. 721 


36 


. 962 


1 


.00 


39 


. 09 


ATOM 
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A 


240 


22 


. 107 


13 


. 970 


38 


. 057 


1 


.00 


39 


.45 


ATOM 


1965 
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ASP 


A 


240 


22 


. 853 


14 


.406 


39 


. 351 


1 
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36 


.92 


ATOM 


1966 


Q 


ASP 


A 


240 


23 


. 147 


13 


. 578 


40 


.211 


1 


.00 


38 


.63 


ATOM 
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CB 


ASP 


A 


240 


21 


.308 


12 


. 688 


38 


.291 


1 


. 00 


39 


.46 


ATOM 
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CG 
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A 


240 


20 


.200 


12 


. 847 


39 


. 313 
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.00 


39 


.85 


ATOM 


1969 
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240 


19 


. 876 


13 


. 993 


39 


. 740 
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.00 


38 


.51 


ATOM 
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0D2 
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240 


19 


. 594 


11 


. 843 


39 


. 742 


1 


. 00 


42 


. 08 
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PRO 


A 


241 


23 
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15 
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39 


.479 
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36 


.36 
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A 


241 


23 


. 862 


16 


.215 


40 


. 674 


1 


. 00 


37 


.38 


ATOM 


1973 
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PRO 


A 


241 


23 


.215 


15 


. 841 


42 


. 022 


1 


.00 


39 


.56 


ATOM 


1974 


o 


PRO 


A 


241 


23 


. 926 


15 


. 723 


43 


. 021 


1 


.00 


39 


.31 


ATOM 


1975 


CB 


PRO 


A 


241 


23 


.785 


17 


. 740 


40 


. 492 


1 


. 00 


37 


.29 


ATOM 
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CG 


PRO 


A 


241 


23 


. 527 


17 


, 971 


39 


. 061 


1 


. 00 


37 


.23 


ATOM 
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PRO 


A 


241 


22 


. 932 


16 
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38 


.475 


1 


. 00 


37 


. 85 


ATOM 
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SER 


A 


242 


21 


. 893 


15 


. 666 


42 


. 043 
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38 


.69 
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242 


21 
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15 
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.282 
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37 


.48 
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242 
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. 00 


35 
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20 
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13 
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44 
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242 
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. 652 


15 


. 764 


43 
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41 


. 14 


ATOM 
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OG 


SER 


A 


242 


18 


. 980 


14 
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42 


. 410 


1 


. 00 


43 


.22 


ATOM 


1984 


N 


SER 


A 


243 


21 


, 821 


13 


. 121 


42 


. 943 


1 


.00 


34 


.07 


ATOM 


1985 


CA 


SER 


A 


243 


22 


. 116 


11 


.752 


43 


. 336 


1 


.00 


37 


.77 


ATOM 


1986 


c 


SER 


A 


243 


23 


.381 


11 


. 645 


44 


.197 


1 


. 00 


36 


.66 


ATOM 


1987 


o 


SER 


A 


243 


23 


. 640 


10 


. 589 


44 


. 763 


1 


. 00 


40 


. 77 


ATOM 


1988 


CB 


SER 


A 


243 


22 


. 270 


10 


. 852 


42 


. 096 


1 


.00 


38 


. 01 


ATOM 


1989 
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SER 


A 


243 


23 


.450 


11 


. 164 


41 


.363 


1 


.00 


40 


.41 


ATOM 


1990 
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VAL 


A 


244 


24 


. 172 


12 


.717 


44 


. 266 


1 
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37 


. 13 


ATOM 


1991 


CA 
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244 


25 


. 385 


12 


. 747 


45 


. 092 


1 
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36 


. 10 


ATOM 


1992 
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A 


244 


24 


. 947 


12 


. 748 


46 


. 556 
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. 00 


35 


. 80 


ATOM 


1993 


o 
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A 


244 


24 


. 359 


13 


. 718 


47 


. 049 


1 


. 00 


32 


. 02 
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1994 


CB 
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A 


244 


26 


.303 


13 


. 972 


44 


. 773 
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36 


. 81 


ATOM 


1995 
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244 


27 
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14 


. 047 


45 
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39 


. 14 
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1996 
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244 
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. 804 


13 
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43 
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35 


. 55 
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1997 
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A 


245 


25 


. 225 
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47 


.220 
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. 84 
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1998 
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A 


245 
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11 
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1999 


c 


ASP 


A 


245 


26 
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49 


.471 
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34 
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2000 
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245 


26 
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10 
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49 
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1 


. 00 


36 
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A 


245 


24 
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48 


. 692 
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. 00 


40 
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245 


23 
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50 
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. 00 


43 


. 89 
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245 


23 
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10 


. 292 


51 


. 104 
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. 00 


46 


. 19 
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245 
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. 732 


8 
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. 00 


48 


. 65 
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ILE 
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. 268 
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. 00 


34 


.58 


ATOM 


2006 


CA 


ILE 
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246 
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51 


. 100 
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34 


.24 
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2 007 
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ILE 
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246 


27 


. 746 


11 
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52 


. 049 
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, 00 


34 


. 35 
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2008 
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ILE 


A 


246 


28 


.913 


11 


.284 


52 


. 334 


1 


. 00 


34 


.47 


ATOM 


2009 


CB 


ILE 


A 


246 


27 


.297 


14 


. 049 


51 


. 866 


1 


.00 


36 


.01 


ATOM 


2010 


CGI 


ILE 


A 


246 


28 


.607 


14 


.400 


52 


.578 


1 


.00 


38 


.62 


ATOM 


2011 


CG2 


ILE 


A 


246 


26 


. 156 


14 


. Oil 


52 


.868 


1 


.00 


40 


.14 


ATOM 


2012 


CDl 


ILE 
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246 


28 


. 676 


15 


. 858 


53 


. 015 


1 


.00 


41 


.03 


ATOM 


2013 


N 


LYS 


A 


247 


26 


. 717 


10 


. 836 


52 


. 502 
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.00 


33 


. 97 


ATOM 


2014 


CA 


LYS 


A 


247 


26 


. 883 


9 


. 675 


53 


.372 
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35 


. 91 


ATOM 


2015 
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LYS 


A 


247 


27 


.561 
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.541 


52 
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34 


.60 


ATOM 


2016 
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A 


247 
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.478 


7 


.899 


53 


. 130 
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.00 


33 
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2017 


CB 


LYS 


A 


247 
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.533 


9 


.210 
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949 


1 . 


00 


37 


.68 


ATOM 


2018 


CG 
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A 


247 


24 


. 698 


10 


.317 


54 . 


636 
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00 


41 


. 97 
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2019 


CD 
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A 


247 


23 


.300 


9 


.831 


55. 


115 


1 . 


00 


45 


. 11 
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2020 


CE 
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247 
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.342 


9 


.488 
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965 
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00 


45 


.33 
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2021 


NZ 


LYS 
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247 
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00 
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248 
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00 
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00 
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248 
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00 
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010 
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00 
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248 
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7 
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49. 


300 
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00 


41 
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CG 
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248 
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.439 
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.441 


49. 


645 
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00 


47 


.35 


ATOM 


2028 
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248 
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. 879 


5 


.571 


48 . 


509 
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00 


51 


.20 
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CE 
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248 


23 
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5 
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48. 


712 
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00 


52 


.76 
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2030 


NZ 
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248 


22 
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6 


. 336 


48. 


233 
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00 


51 


.45 
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249 
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49. 


859 
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00 


34 
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249 
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49. 


505 
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00 


30 
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2033 


c 


VAL 


A 


249 
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00 
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249 
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00 
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CB 
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249 
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48 . 
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00 


28 


. 72 
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249 


32 
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11 
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48. 
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00 


32 


.77 
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2037 


CG2 
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A 


249 


30 


. 040 


10 


.918 


47. 


729 


1. 


00 


30 


.80 


ATOM 
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N 


LEU 
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250 


31 


.203 


9 
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51 . 


889 


1. 


00 


28 


.31 
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CA 
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250 


31 


. 950 


9 


. 355 


53 . 
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00 


30 


.22 
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2040 
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LEU 


A 


250 


32 


.248 


7 
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53 . 


379 
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00 


29 


.53 


ATOM 


2041 
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LEU 


A 
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. 750 


27 


. 202 


69 


. 951 


1. 


00 


46 


.42 


HETATM 


2448 


o 


HOH 


z 


138 


35 


. 848 


19 


. 568 


69 


.764 


1. 


00 


50 


.41 


HETATM 


2449 


o 


HOH 


z 


139 


37 


. 851 


21 


. 728 


66 


. 890 


1. 


00 


36 


.19 


HETATM 


2450 


o 


HOH 


z 


140 


30 


. 783 


23 


. 866 


66 


. 937 


1. 


00 


50 


.34 


HETATM 


2451 


o 


HOH 


z 


141 


33 


. 726 


27 


.401 


67 


. 001 


1. 


00 


58 


.33 


HETATM 


2452 


o 


HOH 


z 


142 


28 


. 915 


24 


. 727 


61 


.642 


1. 


00 


47 


.60 


HETATM 


2453 


o 


HOH 


z 


143 


33 


.769 


31 


.236 


57 


.483 


1. 


00 


46 


.31 


HETATM 


2454 


o 


HOH 


z 


144 


34 


. 891 


27 


. 864 


57 


.438 


1. 


00 


42 


.43 


HETATM 


2455 


o 


HOH 


z 


145 


46 


.252 


29 


. 134 


45 


. 085 


1. 


00 


41 


.22 


HETATM 


2456 


0 


HOH 


z 


146 


33 


. 519 


21 


. 089 


41 


. 828 


1. 


00 


28 


.37 


HETATM 


2457 


o 


HOH 


z 


147 


36 


. 513 


25 


. 357 


42 


.533 


1. 


00 


28 


.71 


HETATM 


2458 


o 


HOH 


z 


148 


44 


. 082 


21 


.238 


41 


.779 


1. 


00 


24 


.55 


HETATM 


2459 


o 


HOH 


z 


149 


44 


. 511 


30 


. 853 


43 


.295 


1. 


00 


52 


.96 


HETATM 


2460 


o 


HOH 


z 


150 


37 


. 321 


31 


.395 


45 


. 097 


1. 


00 


57 


.97 


HETATM 


2461 


o 


HOH 


z 


151 


50 


. 444 


27 


. 054 


41 


.455 


1 . 


00 


46 


.22 


HETATM 


2462 


o 


HOH 


z 


152 


50 


.470 


23 


.304 


40 


. 354 


1 . 


00 


46 


. 12 


HETATM 


2463 


o 


HOH 


z 


153 


49 


.473 


17 


. 014 


44 


. 938 


1 . 


00 


22 


.23 


HETATM 


2464 


0 


HOH 


z 


154 


44 


. 072 


13 


. 687 


42 


. 388 


1 . 


00 


33 


.61 


HETATM 


2465 


o 


HOH 


z 


155 


46 


. 743 


15 


. 157 


42 


.567 


1. 


00 


28 


.06 


HETATM 


2466 


o 


HOH 


z 


156 


43 


. 696 


21 


. 518 


37 


. 147 


1. 


00 


30 


.08 


HETATM 


2467 


o 


HOH 


z 


157 


44 


. 029 


19 


. 607 


35 


, 637 


1 . 


00 


33 


.31 


HETATM 


2468 


o 


HOH 


z 


158 


49 


. 032 


22 


. 171 


37 


. 565 


1. 


00 


50 


.29 


HETATM 


2469 


o 


HOH 


z 


159 


46 


. 521 


12 


.3 92 


42 


. 14 0 


1. 


00 


47 


.31 


HETATM 


2470 


o 


HOH 


z 


160 


52 


. 916 


22 


. 239 


37 


. 026 


1 . 


00 


55 


.49 


HETATM 

X X. X A A 


2471 


o 


HOH 


z 


161 


45 


, 869 


13 


. 974 


60 


. 909 


1 . 


00 


34 


.41 


HETATM 


2472 


o 


HOH 


z 


162 


48 


. 083 


16 


.068 


66 


.708 


1. 


00 


33 


. 89 


HETATM 


2473 


o 


HOH 


z 


163 


43 


. 023 


14 


. 129 


61 


. 953 


1 . 


00 


53 


. 88 


HETATM 


2474 


o 


HOH 


z 


164 


38 


.756 


19 


.276 


65 


.202 


1. 


00 


36 


.58 


HETATM 


2475 


0 


HOH 


z 


165 


39 


.907 


13 


.260 


59 


.270 


1. 


00 


40 


.54 


HETATM 


2476 


o 


HOH 


z 


166 


36 


. 800 


15 


. 818 


59 


.356 


1. 


00 


38 


.21 


HETATM 


2477 


0 


HOH 


z 


167 


30 


.756 


21 


. 819 


41 


. 946 


1. 


00 


52 


.32 


HETATM 


2478 


o 


HOH 


z 


168 


26 


. 321 


18 


. 194 


36 


. 951 


1. 


00 


49 


.33 


HETATM 


2479 


o 


HOH 


z 


169 


28 


.298 


20 


.489 


35 


.444 


1. 


00 


47 


.56 


HETATM 


2480 


0 


HOH 


z 


170 


34 


.060 


21 


.200 


39 


.322 


1. 


00 


37 


.57 
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HETATM 


2481 


0 


HOH 


Z 


171 


31 


. 024 


17 


. 705 


33 . 


835 


1. 


00 


31 


.46 


HETATM 


24 82 


o 


HOH 


z 


172 


35 


.364 


14 


. 356 


33 . 


358 


1 . 


00 


50 


.31 


HETATM 


2483 


o 


HOH 


z 


173 


37 


. 090 


14 


.498 


37. 


690 


1. 


00 


47 


.58 


HETATM 


2484 


0 


HOH 


z 


174 


29 


. 883 


11 


.426 


33- 


922 


1. 


00 


37 


.75 


HETATM 


2485 


0 


HOH 


z 


175 


30 


.389 


15 


.363 


32. 


808 


1, 


00 


37 


.18 


HETATM 


24 86 


o 


HOH 


z 


176 


25 


.224 


10 


.459 


26. 


647 


1. 


00 


58 


.79 


HETATM 


24 87 


0 


HOH 


z 


177 


26 


. 104 


10 


. 611 


35 . 


365 


1. 


00 


35 


.22 


HETATM 


2488 


o 


HOH 


z 


178 


19 


. 926 


9 


.452 


39. 


387 


1. 


00 


55 


.25 


HETATM 


2489 


o 


HOH 


z 


179 


19 


. 723 


16 


.723 


39. 


659 


1. 


00 


52 


.43 


HETATM 


2490 


o 


HOH 


z 


180 


19 


. 071 


11 


.323 


42 . 


461 


1. 


00 


51 


.41 


HETATM 


2491 


0 


HOH 


z 


181 


24 


.087 


10 


. 832 


37 . 


321 


1. 


00 


35 


.20 


HETATM 


2492 


o 


HOH 


z 


182 


23 


. 535 


16 


. 129 


45 . 


715 


1 . 


00 


43 


.59 


HETATM 

XXXJ X X I'l 


2493 


o 


HOH 


z 


183 


21 


.411 


12 


.400 


47 . 


935 


1. 


00 


51 


.82 


HETATM 

XXXJ X X I'l 


2494 


o 


HOH 


z 


184 


26 


.338 


9 


. 343 


45 . 


940 


1 . 


00 


38 


.29 


HETATM 

XXXil X X 11 


2495 


0 


HOH 


z 


185 


28 


. 937 


4 


. 006 


49. 


860 


1. 


00 


46 


.67 


HETATM 


2496 


0 


HOH 


z 


186 


35 


.592 


1 


. 182 


45. 


532 


1. 


00 


43 


.21 


HETATM 


2497 


o 


HOH 


z 


187 


42 


.181 


12 


.293 


59. 


682 


1 . 


00 


44 


.13 


HETATM 


2498 


o 


HOH 


z 


188 


40 


.677 


11 


.462 


62 . 


985 


1. 


00 


35 


.66 


HETATM 


2499 


0 


HOH 


z 


189 


34 


.830 


8 


.329 


59. 


655 


1. 


00 


45 


.13 


HETATM 


2500 


0 


HOH 


z 


190 


39 


.290 


9 


.584 


68. 


605 


1. 


00 


52 


.23 


HETATM 


2501 


o 


HOH 


z 


191 


29 


.697 


13 


.317 


68. 


291 


1. 


00 


52 


.85 


HETATM 


2502 


o 


HOH 


z 


192 


24 


.761 


18 


. 714 


66. 


413 


1. 


00 


59 


.04 


HETATM 


2503 


o 


HOH 


z 


193 


29 


.424 


16 


.578 


68 . 


004 


1. 


00 


52 


.70 


HETATM 


2504 


0 


HOH 


z 


194 


28 


.068 


22 


.230 


62 . 


043 


1 . 


00 


49 


. 54 


HETATM 


2505 


o 


HOH 


z 


195 


26 


.825 


29 


.550 


56. 


806 


1. 


00 


54 


.77 



END 
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TABLE 2 

All atoms as in Table 1 (incorporated herein by reference) except for the following 
six changes: 

1) Delete lines for Atom 1770-1793 



2) Replace with: 



ATOM 


1770 


N 


CYS 


A 


215 


45 


.982 


16 


.650 


59 


.851 


1. 


00 


21 


.29 


ATOM 


1771 


CA 


CYS 


A 


215 


45 


.553 


17 


.584 


60 


.885 


1. 


00 


21 


.43 


ATOM 


1772 


C 


CYS 


A 


215 


46 


.211 


17 


.303 


62 


.215 


1. 


00 


24 


.05 


ATOM 


1773 


O 


CYS 


A 


215 


47 


.355 


16 


. 913 


62 


.307 


1 . 


00 


22 


.32 


ATOM 


1774 


CB 


CYS 


A 


215 


44 


.037 


17 


. 520 


61 


. 166 


1 . 


00 


22 


.73 


ATOM 


1775 


SG 


CYS 


A 


215 


43 


.916 


18 


. 029 


62 


. 855 


1 . 


00 


30 


. 96 


ATOM 


1776 


N 


SER 


A 


216 


45 


.432 


17 


.553 


63 


.271 


1. 


00 


28 


.96 


ATOM 


1777 


CA 


SER 


A 


216 


45 


.887 


17 


.111 


64 


.593 


1. 


00 


33 


.95 


ATOM 


1778 


C 


SER 


A 


216 


45 


.405 


15 


. 709 


64 


.892 


1 . 


00 


38 


.49 


ATOM 


1779 


o 


SER 


A 


216 


44 


.329 


15 


.318 


64 


.490 


1. 


00 


38 


.68 


ATOM 


1780 


CB 


SER 


A 


216 


45 


.451 


18 


. 035 


65 


. 735 


1. 


00 


34 


.57 


ATOM 


1781 


OG 


SER 


A 


216 


45 


.116 


19 


.351 


65 


.362 


1. 


00 


36 


.62 



3) Renumber following atoms (from ALA 217) accordingly. 

4) Insert following line after new ATOM 2297 (old ATOM 2309): 

TER 2298 GLY A 283 



5) Insert following Header to PDB file: 



LINK 


MG 


MG 


A1282 


O 


HOH 


Z 


106 


1555 


4546 


LINK 


MG 


MG 


A1282 


O 


HOH 


z 


10 


1555 


1555 


LINK 


MG 


MG 


A1282 


O 


HOH 


z 


40 


1555 


1555 


LINK 


SG 


CYS 


A 215 


N 


SER 


A 


216 


1555 


1555 



6) Insert following Footer instead of END line: 

CONECT 1775 177 6 

CONECT 1776 1775 

CONECT 2299 2405 2309 2339 

CONECT 2309 2299 

CONECT 2339 2299 

CONECT 2405 2299 

MASTER 498 0 1 9 11 0 1 6 2493 1 6 
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END 



